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Nov^i ket^ods for ii-;e identification ce ligan? add target biomolecules 
F1F1D OF THE INVENTION 

The present invention pertains to a novel method for the identi- 
fication/preparation of peptides or ribonucleic acids capable of 
5 modulating the activity in vivo of target enzymes in eukaryotic 
cells. More specifically, the invention provides a method for 
-i Hnn t i f i c.=* t -i on /oreoar a t ion of hitherto unknown enhancers as well 
as inhibitors of in vivo enzyme activity in eukaryotic cells. 
Furthermore, the invention relates to methods for identification 

10 of unknown interactions (i.e. identification of a target and/or 
a ligand but also of hitherto unknown interactions between known 
ligands and known targets) . These novel methods employ enzyme 
inhibitor structures as scaffolds in order to int race 1 lula r 1 y 
present potentially biologically active peptides or ribonucleic 

15 acids in a stable form. Also disclosed herein are methods for 
the preparation of the hitherto unknown ligands or targets as 
well as methods for the preparation of vectors and transformed 
cells carrying the genetic information encoding these ligands 
and targets. Finally, the invention relates to a method for the 

20 preparation of a medicinal product which is based on initial 
identification of targets or ligands according to the present 
invent ion . 

BACKGROUND OF THE INVENTION 

The Cell Screen technique is a method that allows for the iden- 
25 tification of peptide sequences having biological activity in 
vivo and is disclosed in WO 96/38553. In short, libraries of 
random peptides are expressed int racel lular ly in mammalian 
cells, such that one cell expresses one single or a few 
heterologous short peptides. Cells that change a preselected 
30 phenotype under certain conditions can be isolated and the 
peptide that they express can hence be identified. The 
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intracellular component with which the peptide interacts (the 
target molecule) may subsequently be obtained using e.g. affini- 
ty columns carrying the immobilized synthetic peptide. 

Although the CellScreen technology has shown great promise for 
5 identifying new drug targets, it is an inherent problem that the 
intracellular environment is relatively hostile to many expres- 
sion products which are expressed. In other words, interesting 
peptide or nucleic acid sequences which potentially are capable 
of interacting with an important target molecule may be degraded 
10 or inactivated inside the cell before any effect on phenotype 
can be detected. 

OBJECT OF THE INVENTION 

It is an object of the invention to provide improvements in the 
CellScreen technology by overcoming the above-mentioned problems 
15 of potential instability of expressed sequences. Furthermore, it 
is an object of the invention to expand the utility of the 
CellScreen technology to also encompass screening in prokaryotic 
eel Is . 

SUMMARY OF THE INVENTION 

20 A significant number of enzyme activity modulators of plant, 
microbial and eukaryotic cell origin have been described, cf . 
below . 

Since many of the naturally occurring processes inside cells are 
regulated by enzymes, the inventors disclose herein a method for 
25 expression of large intracellular libraries of such enzyme 
activity modulators in which the active site of said enzyme 
activity modulators have been altered by introduction of 
stretches of randomized amino acid sequences or by introduction 
of random nucleotides ac specific sites in the active site. This 
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cro ,;os libraries of putative icdulatcrs capable of modulating 
the actrviry of an array of different enzymes inside cells. By 
expressing these modulators in cell lines according * o a novel 
variation of the CellScreen technology the enzymatic regulatory 
5 mechanisms inside the individual ced in said cell line will be 
affected differently leading to different pnenotypic properties 
such as e.g. resistance towards hypoglycemia, cytokine killing, 
toxic compounds, virus infection etc. 

The main advantage of using known enzyme activity modulators 

1C such as enzyme inhibitors as scafiolds is that many of these in 
their native form are stable in the intracellular environment. 
The problem of using e.g. antibody fragments as scaffolds for 
intracellular presentation of random peptides is that many such 
antioody fragments are susceptible to the proteolytic and redu- 

15 cing intracellular environment and therefore such molecules can, 
if carefully tailored, maintain their intracellular stability 
and at the same time incorporate rancom sequences which are 
screened for biological activity. Furthermore, the effectivity 
of such a screen for biologically active substances will be 

20 higher than if using an unstable scaffold (such as a e.g. a 

coilec coil structure) or no scaffold at all, since none or only 
a very limited number of the randomized sequences will be de- 
graded before they can exert their effects in the ceils. Fi- 
nally, a majority cf such enzyme activity modulators have an 

25 active site which is the perfect position in the molecule to 

modify, since the active site is normally presented in a stable 
configuration to the environment. 

Hence, one aspect of the invention pertains to a method 
for identifying an in vivo active modulator of activity of a 
30 target enzyme, the method comprising the steps of (a) preparing 
a pool of expression vectors, each vector cf said pool contain- 
ing at least one member from a library of randomly modified 
nucleotide sequences derived from a parent nucleotide sequence 
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encoding a pa rem peot .. i do or parent ribonucleic acid which 
reflates the target enzyce activity, (b) transforming a popula- 
ii3>n of substantially identical cells with said vectors of said 
pool so as to obtain transformed cells, said substantially 
5 identical cells being ones which harbour the target enzyme, (c) 
culturing said transformed cells under conditions facilitating 
expression of said randomly modified nucleotide sequences, (d) 
examining said transformed cells and isolating transformed 
cell ( S ) wherein the activity of the target enzyme is modulated, 
1C and (e) determining said randomly modified nucleotide sequence 
of said vector present in cell(s) isolated in step (d) and/or 
determining the amino acid sequence or the ribonucleic acid 
sequence of the expression product encoded by said randomly 
modified nucleotide sequence. 

15 In this part of the invention it is normally preferred that the 
randomly modified nucleotide sequences consist of 1) an invari- 
able part of the parent nucleotide sequence, and 2) random 
nucleotides. In line with the above, the invariaoie part of tne 
parent nucleotide sequence preferably encodes a scaffold portion 

20 of the parent peptide or of the parent ribonucleic acid which 
serves to stabilize said polypeptide fragment or ribonucleic 
acia fragment. 

As mentioned above, the use of enzyme modulator scaffolds also 
provides for expression of stable, biologically active modal a - 
25 tors which interact with other bionoiecules than enzymes. In 

other words, in cases where random nucleotides are inserted in a 
scaffold structure, the outcome will be a stable expression 
oroduct, the activity of which does not necessarily nave any- 
thing to do wi~h enzyme activity modulation. 

30 Hence, another part of the invention is a method for identifying 
a modulator in the form of a biologically active polypeptide 
fragment or ribonucleic acid fragment which is capable of de- 




tec-: ably :i; o '.iuldLi n g , in vivo, a phenotypic trait in a cell, the 
method c or: prising the ste p s o t 

( a ) creuar ing a pool c f expression vectors, each vector of said 
pool containing at least one member from a library of randomly 

5 modified nucleotide sequences derived from a parent nucleotide 
sequence encoding a parent peptide or parent ribonucleic acid 
which in vivo modulates activity of a known enzyme, wherein the 
randomly modified nucleotide sequences comprise 
an invariable part encoding a scaffold portion of the parent 
1C peptide or of the parent ribonucleic acid, said scaffold portion 
serving to stabilize said polypeptide fragment or ribonucleic 
acid fragment, and 
- random nucleotides, 

(b) transforming a population of substantially identical ceils 
15 with said vectors of said pool so as to obtain transformed 

cells, 

(c) culturinq said transformed cel_s under conditions facilita- 
ting expression of said randomly modified nucleotide sequences, 

(c) examining said transformed cells and isolating transformed 
20 cell (sj wherein the preselected phenotypic trait is modulated by 
the presence of the expressed randomly modified nucleotide 
sequence, and 

(e) determining said randomly modified nucleotide sequence of 
said vector present in cell(s) isolated in step (d) and/or 
25 determining the amino acid sequence or the ribonucleic acid 
sequence of the expression product encoded by said randomly 
modified nucleotide sequence. 

Finally, the invention also pertains to the general use of 
int race! iularly stable scaffold proteins, ribonucleotides, or 
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fragxents thereof for the presentation of randor. sequences in 
the CeilScreen technology. As xer.tioned above, although the 
concept of using scaffold xoleeules has been discussed in the 
prior art, the issue of the stability of the scaffold system has 
5 not oeen detailed. 

The stability and usefulness of a putative intracellular scaf- 
fold is dependent on a nunoer of factors. First of all, it is 
essential that the relevant cell wherein the scaffold is to be 
expressed is capable cf expressing the scaffold molecule in a 

1C functional form; that is, in prokaryotic systems some eukaryotic 
proteins will not fold correctly, hence rendering the use of 
such a protein unsuitable as a scaffold in that type of cell. 
Second, the scaffold should be relatively resistant to the 
reducing and catalytic environxent inside intact cells. However, 

15 even when a scaffold molecule is relatively susceptible to the 
inactivating nature of the intracellular environment, this can 
be remedied if the production rate of the scaffold molecule is 
sufficiently high. 

In steady state, the intracellular concentration of a scaffold 
20 molecule will be a function of the following formula: 

c - ^ 

^scaffold - fij 

-where R D is the rate of production of the scaffold molecule 
(moles - s" 1 ) and R a is the inactivation constant for the scaffold 
molecule (1 * s" : ), i.e. the rate of inactivation of the scaffold 

25 molecule is determined by = R d C scajfold (which, in steady state, 

of course equals ?L . In other words, when assessing the suitabi- 
lity of a potential scaffold molecule, it should according to 
the present invention be tested whether the molecule can be kept 
at a sufficiently high concentration inside the relevant cell 
30 wherein the CeilScreen test is going to be carried out. 
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Therefore, a v.-r/ broad aspect cf the invention certains to a 
method fcr identifying a modulator : r. the torn of a biologically 
active polypeptide fragment or ribonucleic nciu fragment, which 
13 caoacie of de \ cc t a:.:. 1 y mcdu 1 a t l ng , in vivo, a phenotypic trait 
5 of a cell., the ne t h oo ccirp risi n g t n e c t e p s o f 

[a) preparing a pool t f expression vectors, each vector of said 
oool containing at least one member from a library of randomly 
r > v i - •• p:1 naclecnide seauer.ces derived fro- a parent nucleotide 
sequence encoding a parent peptide or parent ribonucleic acid 
10 which is stable int lacel luia r ly , wnerein the randomly modified 
nucleotide sequences comprise 

an invariable part encoding a scaffold portion of the 
parent perfide or of the parent ribonucleic acid, said 
scaffold portion serving to stabilize said polypeptide 
IS fragment or ribonucleic acid fragment, and 

- random nucleotides, 

(o) transforming a population of substantially identical cells 
with said vectors of said pool so as to ootam transformed 
cells, 

20 (c) culturing said transformed ceils under conditions facilita- 
ting expression of said randomly modified nucleotide sequences, 

(a) examining said transformed cells ana isolating transformed 
cell (s) wherein the preselectec phenotypic trait is modulated by 
the presence of the expressed randomly modified nucleotide 
2 5 sequence , and 

(e) determining said randomly modified nucleotide sequence of 
said vector oresent in cell (s) isolated in step (d) and/or 
determining the amino acid sequence or the ribonucleic acid 
sequence of the expression product encoded by said randomly 
3C modified nucleotide sequence, 
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wherein the expression product: of the nucleic acid sequence 
which encodes the int r acel lu 1 a r 1 y stable parent peptide or 
oarent ribonucleic acid is one which, when produced by the 
substantially identical cells, is present in an effective con- 
5 cent rat i on and in a functional state. 

In other words, it is essential that the suitability of the 
scaffold molecule is evaluated prior to performing the steps of 
the CellScreen technology in order to confirm that the scaffold 
molecule in unmodified form can be expressed and maintained at a 
10 sufficiently high concentration/activity in the cellular system 
where the method of the invention is to be exercised. 

According to WO 96/38553, the isolation of a drug target mole- 
cule can be made more efficient if the random peptide sequences 
are inserted into larger polypeptides functioning as scaffolds 
15 for display of the random amino acid sequences. Such scaffolds 
would probably also leaa to higher affinity interaction with the 
target molecule. 

Nothing is, however, mentioned about the use of scaffolds de- 
rived from naturally occurring protein inhibitors of enzymes. 
20 Inhibition of enzymatic activity by such inhibitors - as opposed 
to the simple binding of a target protein inside a cell as was 
suggested in the CellScreen technology - is a much more effi- 
cient way to affect intracellular biochemical events. 

DETAILED DISCLOSURE OF THE INVENTION 

2 5 Definitions 

In the following, a number of terms will be defined for the 
purposes of the present disclosure: 
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A "modulator" is in the present context a biomolecule which, 
when expressed in vivo, to effect the activity of another biomo- 
lecule in the cell. Thus, the modulator in essence can inhibit 
or enhance the activity of the biomolecule. Furthermore, the 
b modulator can interact directly with the biomolecule, but the 
effect might as well be indirect (also known as t ransdomina n t ) , 
i.e. the activity change of the biomolecule is brought about by 
changes in the cell's biochemical machinery, changes which are 
ultimately the result of the presence of the expression product 
10 of the randomized nucleic acid sequence. 



A ^randomly modified nucleotide sequence" is a nucleotide se- 
quence which in a number of positions has been subjected to 
insertion or substitution by nucleotides, the nature of which 
cannot be predicted. In many cases the random nucleotides or 

15 nucleotide sequences inserted will ce "completely random" (e.g. 
as a consequence of randomized synthesis or PCR-mediated 
mutacenesis) - However, as will appear from the disclosure below, 
the random sequences can also induce sequences which have a 
common functional feature (e.g. reactivity with a ligand of the 

20 expression product) or the random sequences can be random in the 
sense that the ultimate expression product is of completely 
random sequence with e.g. an even distribution of the different 
amino acids . 

"Substantially identical cells" is a term herein intended to 
25 designate cells which all exhibit a specific phenotypic trait in 
such a manner that a change in the expression of said trait in 
one cell due to an interaction effected by the introduction of 
random nucleotides according to the inventive methods would also 
occur in one cf the other substantially identical cells had 
30 these been transformed with the same vector. In other words, the 
important parameter to assess when choosing substantially iden- 
tical cells in the inventive methods is whether an observed 
change in one cell's exhibition of the phenotypic trait can be 
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taken as an i-dicatir;n that any ether cell in the peculation 
would have oenavea the same way as a consequence of the sane 
change. Hence, substantially identi-ai ceils can be clonal cells 
or cells of a cell line or they can be cells of a cell culture 
5 or a tissue culture. 

A "pher.otyoic trait" is the observable result of a certain gene 
composition in a cell (genotype), i.e. a property of a cell 
(detected by cherr.ical, physical, immuno legi ca 1 or any other 
suitaole means) which depends on the presence of one or several 
10 genes and the expression rate thereof. Thus, the pnenotypic 

trait can be any of a number of different properties: activity 
of an enzyme, effects of interaction between receptors and 
Iigands, cell survival rate, presence or absence of an antigen, 
expression rate, etc. 

15 "Peptide" is in the present context intended to mean both short 
peptides of from 2 tc 10 amino acid residues, oligopeptides of 
from 11 to ICO amino acid residues, and polypeptides of more 
than 100 amino acid residues. Furthermore, the term is also 
intended to include proteins, i.e. functional biomolecules 

2C comorising at least one polypeptide; when comprising at least 
two polypeptides, these may form complexes, be covalently lin- 
ked, or may be non-cova lent 1 y linked. The pol ypept ide ( s ) in a 
protein can be glycosylated and/or lipidated and/or comprise 
prosthetic groups . 

25 "In vivo" is herein mean to designate the environmental condi- 
tions inside living ceils (i.e. ceils which are met abolicaUy 
active and can maintain their vital functions); the living cells 
may be kept in culture or may be present in a natural habitat 
(e.g. functioning as part of a larger, multicellular organism). 

30 Thus, the term "in vivo" also refers to in vitro culturing of 
cells as long as the effect being observed is taking place in 
the living eel 1 . 



"Transformation": A process oy which the genetic material car- 
ried by an individual cell is altered oy incorporation of exc 
genous ON A into its genome. 

"Tra.nsfection" : The uptake, incorporation, and expression of 
5 recombinant DNA by cells. 



^Trar.sGucuon": The transfer of genetic information from one 
cell t-o another by way of a viral vector. 

The term "effective part" when used in the context of a protein, 
peptide, or ribonucleic acid is in the present context intended 

10 to mean a part (e.g. a subsequence or, in the case of an n-meric 
protein, a les s-z han-n-mer ic molecule) which has retained the 
desired functionality of the native nolecuie from which the 
protein or peptide is derived. For instance, in the case of 
CT-2A only the truncated form of the molecule described in 

15 Example 1 seems to be necessary to ensure expression of the 

active inhibitor int racel lular ly , and hence the truncated form 
of that molecule constitutes an effective part of CI-2A. 



G eneral orooerties of the inventive method 

In general, the disclosures in WO 96/38553 and WO 97/27212 
20 relating 

to preparation cf randomized sequences, to the choice of 
fusion partners (except for the choice of scaffolds) for 
the random sequences, to the choice and composition of 

25 targeting sequences, to the choice of nucleic acids to be 

randomly modified, to the choice of randomization method, 
to the methods of introducing the nucleic acids into the 
relevant cell type, to the choice of (retroviral) vectors 
(where applicable) , to the method of producing the vectors, 

3C the choice of promoters, to the choice of packaging cells 



[where applicable), to :he methods of ccncer.:ra:ing infec- 
tious virions frcr, the packaging ceils (where applicable), 
to the choice of substantially identical ceils to use in 
the xethed, to the type of pnenotypic changes detected, to 
5 the manner in which the change is detected, to methods of 

isolating the phenot ypi ca 1 1 y changed ceils, to the isola- 
tion and sequence determination of the randomly modified 
sequences, to the isolation and characterization of the 
target for the randomized product, to screening methods, 
10 ana to the choice of applications of the methods 

also are relevant for the purposes of the present invention. 
Therefore, the disclosures of these two patent applications are 
hereby incorporated by reference herein. However, since both of 
these references are focussed on the use of the general prin- 

15 cipie in higher eukaryctic cells, the present disclosure will 

also detail on embodiments pertaining tc the use in prokaryotic 
systems, cf . below. However, the icre general part of the two 
above-referenced disclosures which without any difficulty for 
the skilled person could be applied in the context of a prokary- 

20 otic system are also regarded as relevant and important embodi- 
ments of the present invention insofar as it relates to the use 
of the methods in prokaryotic systems. 

It is normally preferred that the transformed ceils which are 
being examined in step (d) predominantly carries (and expresses) 

25 one single copy of the vector. By ensuring this, the interpreta- 
tion of a change in phenotype of the cells becomes a much easier 
task, whereas the interpretation of a phenotype change in cells 
expressing more than one single randomized sequence renders 
unclear which of the transforming vectors is responsible for the 

30 change. 

To ensure that predominantly one vector has transformed each of 
the cells examined it is e.g. feasible that the transformation 
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s:e p (o) is pertorr-.eJ under such conc_tror;s t;io ' ^ - - - 

transformed are precominanr'.y or at most transformed with one 
:;inc le vector fro- said pool (trus o 3:. e.g. oe achieved by 



.d just l.nq 



■ e r t r a t i o n of infect i o o s v i r i o n s i n embodi ir.e n t : 



5 of the invention where the transformation is obtainou oy means 
, : ,f transduction), or wherein, prior to carrying out step (d) r 
cells beinq transformed with rr.ore than .)n- vector irorr. said pool 
are substantially excluded from the further steps. This latter 
option requires that iz is possible to quantify the number of 
10 transforming vectors and this car. bo achieved by including a 

detectable marker in the expression product, e.g. a flourescent 
probe. Another option is to rescreen cells which exhibit changes 
in phenotyoe, thereby ascertaining whether more than one vectcr 
has transformed the cell. 

15 It will be understood that the molecule chosen for the purpose 
of being a scaffold, and wnerein the random sequences are ulti- 
mately introduced, can be either a peptide sequence or a nucleic 
acid sequence, such as an RNA fragment interfering (in an 
antisense-manner) with mRNA, t RNA or with a ribczyme. Alterna- 

20 tively, such an RNA fragment could exhibit ribozyme activity 
itself, thereby having an indirect influence on the expression 
rate of other enzymes. At any rate, the resulting product, i.e. 
the randomized expression product, can be a peptide or a ribonu- 
cleic acid such as a ribozyne. 

25 One important feature of the scaffold is, as mentioned above, 
that it is stable towards proteolytic attack and/or is insensi- 
tive to a reducing environment, such as the one which is found 
ir.traceiluiarly. 

In preferred embodiments of the invention the random nucleotides 
30 are introduced in part(s) of the parent nucleotide sequence 
which encode (s) the active site(s) of the parent peptide or 
oarent ribonucleic acid, or the part(s) which encode (s) struc- 
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turets) interfering wit.:. Lie dc:ive site (s! . As discussed above, 
the active site (as well as other exposed structures cf the 
scaffold) need to oe stable presentee to the environment ir. 
order to be able to interact with other oiomolecuies . Hence, 
5 preferably the invariable part cf the nucleotide sequence en- 
codes truncated parts of the parent peptide cr parent ribonu- 
cleic acid sufficient to mainrair. stability of the randomized 
product . 

In some embodiments it is preferred that the invariable part of 

10 the parent nucleotide sequence encodes a peptide which is free 
fron disulphide bridges. This is, as mentioned in Example 2 
below, due to the fact that disulphide bridges are not formed in 
the nucleus or in the cytcsoi . Hence, in cases were the scaffold 
must be in a functional state when present in the nucleus or the 

15 cytosol, it would normally be preferable to use a scaffold which 
does not contain disulphide bridges cr which do not rely on 
these in order to maintain stability and functionality. On the. 
other hand, in embodiments where it is desired that the random- 
ized expression product is confined to the ER, or to another 

2C compartment allowing for the presence of stable disulphide 
bridges, before the randomized sequence is presented to the 
environment in a satisfactory manner, it is of course desirable 
that the invariable part of the parent nucleotide sequence 
encodes a peptide having disulphide bridges, oecause the chances 

25 of having a correctly folded and functional scaffold outside 
such compartments is relatively small. 

It will be understood that the random, nucleotides are preferably 
introduced in the form of an insertion or a substitution into 
the parent nucleotide sequence, optionally in combination with 
30 deletion (s) in the parent nucleotide sequence. Deleted sequences 
in the parent polypeptide could e.g. be parts of an active site, 
the presence of which in unaltered form is toxic or otherwise 
deleterious to the transformed cells. 



1 5 

The number of random nucleotides introduced can vary to a great 
degree cut normally the nua,er is between 3 and about 100. In 
this ranee it is preferred that at least 5, such as at least 7, 
and c-rter, at least 9-12 random nucieondes are introduces. Cn 
5 tr.e other hand, it is preferred that at most 90, such as at mc st 
70 or 8 0 random nuclec t iries arc introduced. The most preferred 
number oi introduced randomized nucleotides varies between 15- 
50, oreferably 20-55 or 25-50 nucleotides. Especially preferred 
numbers of random nucleotides are 18, 19, 20, 21, 22, 23, 24, 
1C 25, 2fc, 27, 23, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 
41, 42, 43, 44, 45, 46, 47, 43, 49, 50, 51, 52, 53, 54, 55, 56, 
57, 58, 59, and 60 nucleotides. 

The random nucleotides are introduced in :he scaffold in the 
form of nucleotide sequences and/or in the fcrm or single random 
15 nucleotides introduced at specific sites in the parent nucleo- 
tide sequence. A variation is to substitute a part of the scaf- 
fold sequence with a sequence which retains parts of the scaf- 
fold seauence (e.g. those which are essential for stabi- 
lity/functionality) but where other parts are randomized. 

20 The random nucleotides are preferaoly selected from, the group 
consisting of 

synthetic, completely random deoxyr ibonucleot ides ; 
synthetic random DNA sequences, wherein limitation on 
randomization of some nucleotides is introduced so as to 

25 limit the number of available sequences and/or to avoid 

undesired stop codons and/or to facilitate introduction of 
post-iranslational modifications of expressed peptide (s); 
synthetic random DNA sequences as in (1) or (2) coupled to 
a sequence encoding a purification tag; and 

30 - CDR encoding nucleotide sequences isolated from a library 
of immune-competent cells raised against an antigen (in 
this embodiment it is preferred that CDR encoding nucleo- 
tide sequences encode CDR-3 peptide sequences) . 
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The latter type of vv anc;om,7a:xon" actually introduces a re- 
stricticn on rando.xr.es s which ensures that the sequences intro- 
duced encodes an antigen recognizing region. It is, however, 
well known that a polyclonal immune response against an antigen 
5 consist of a large number of immune competent cells which ail 
react with the same antigen (or perhaps even with the same 
epitope) but the correlation between amino acid sequences of the 
CDRs and recognition of the epitope (s) is virtually impossible 
to deduce. 

10 Ar. alternative way no introduce limitations on the randomness of 
the nucleic acid sequences which are ultimately tested in the 
substantially identical cells is the following: Upon preparation 
of the vectors, they are used in a I s * round of phage display, 
where the phages transformed with the vectors are panned against 

15 a library containing a ligand of choice. As for the technique of 
employing CDR encoding sequences, the result is that the se- 
quences which are ultimately tested in the substantially identi- 
cal ceils are "unpredictable" (and thereby random) but neverthe- 
less selected on the basis of a functional feature. Again, the 

20 lack: of known correlation between nucleic acid sequences and the 
interaction in three-dimensional space between the expression 
product and a ligand of choice has the consequense that the 
tested subgroup of sequences still is randomized. 

In a special embodiment of the above -t echn i que where the method 
25 of the invention is combined witn phage display, both test 

systems are repeated in an alternating manner, that is a snuffl- 
ing between intracellular expression in the substantial identi- 
cal cells and panning of a phage library. 

In order to obtain an controlled distribution of amino acids in 
30 the randomized peptides, when the modulator is a peptide, it is 
practical that the random nucleotides are prepared by codon 
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so. synthesis where dc; D N A c : J d o r. 5 are synthesized in a 

ran : :r; order; a thorough description of this principle is given 
in WO 96/3 3 553. The preferred embodiment ;n this context is one 
wherein the relative amount of synthesized codons ens j re that 
5 ail encoded amine ac: as will oe present with substantially the 
same frequency i n e needed po - ypept ide fragment s . 

In order to introduce the rand;xized fragments properly into the 
vectors, it is according to me invention preferred that the 
random nucleotides are introduced into the expression vector by 
10 tee principle cf site directed PCR-mediated mutagenesis. How- 
ever, other options are known to the skilled person, and it is 
e.g. possible to insert synthetic random sequence libraries into 
the vectors as well. 

Aoart f rom having the random! zed f ragment of the expression 
15 product introduced into a scaffold in accordance with the pre- 
sent invent ion, it is often necessary to couple the random 
sequence to a fusion partner by having the randomized nucleotide 
sequence fused to a nucleotice sequence encoding v at least one 
fusion partner. Such a fusion partner can e.g. facilitate ex- 
20 pression and/cr pur i f i ca t i on / i so 1 a t ion and/or further stabili- 
zation of the expression product. 

For the purposes of purification, the fusion partner can include 
a purification tag such as His 6 tag, myc tag, BSP biotinylation 
target sequence, of BirA, flu tag, lacZ , and GST. Furthermore, 
2 5 the fusion partner may include a sorting signal or a targeting 
sequence, cf. the discussions below. 

I n embodiment s where the modulator is itself a modu 1 a t o r of 
enzyme activity, it is in theory possible to effect both the K x 
and/or the V ff£: , of the relevant enzyme. A reduction in K >: results 
30 in less effectivity of the relevant enzyme insofar that an 

increased substrate concentration is required to obtain 50% of 
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maximum activity of the orzyne. An increase of K, has the oppo- 
site effect. Of course, interference with an enzyme which ef- 
fects has as a consequence that the maximum possible rate of 
activity of the enzyme is increased (when ... is increased) or 
5 decreased (when is increased) . At any rate, tne modulator of 
the enzyrre activity will give the phenotypic impression that the 
enzyme activity nas either beer, inhibited or stimulated. It is 
preferred in this embodiment that tne modulator is an inhibitor. 

When the method of the invention has finally lead tc the identi- 
1C fica.ion of a modulator or of a target therefor, it is preferred 
that the 3 -dimens lona 1 structure of the identified modulator is 
resolved, since this allows for the implementation of rational 
drug screening and computer drug modelling methods. 

Use of the inventive methods in prckarvctic systems 

15 The originally envisaged technology disclosed in WC 96/38553 
focussed on screening for interactions in eukaryotic cells. 
However, the technology is alsc applicable in prokaryctic sys- 
tems . 

For instance, it is expected that the present invention will 
2C allow for identification cf hitherto unknown interactions in 
pathogenic bacteria, interactions which will be useful in the 
course of developing new antibiotics. Since the inventive me- 
thods allows for the identification of both novel ligand pep- 
tides and ribonucleic acids as well as of the target molecules 
25 for these ligands, the investigator is provided with the neces- 
sary tools for instigating computer drug modelling and for 
performing traditional drug screening, once such ligands and/or 
targets have been identified/isolated. 

However, apart from the approach of identifying antibacterial 
30 effects and substances, the method also opens up for improve- 

V1AF Bitvt-cVCc»Sciccn2 OK l-TK/lW-0"(> 



1 9 

ments in industrial fermentation processes. In sucl: cases it 
will e.g. be possible to identify biomolecules which are lnpor- 
tj;,:; in tie biochemical pathways in lactic acid bacteria and 
thereby orovide tools for the production of new dairy products 
b such as cheese, yoghcurt, and other products of lactic acid 
bacterial ferment at ion . 

Somewhat related to this approach is the use of the methods of 
the present invention in screening performed on bacterial cul- 
tures used m purification processes. It is well-known m the 

1C art of e.g. waste water purification that the microbiological 
cultures (activated sludge) which conduct the degradation of 
organic material, are relatively vulnerable vis a vis changes in 
the environment and therefore the provision of more robust 
strains of bacteria wouic oe one way to improve such sysiems. 

15 Alternatively, the method of the present invention would also 

allow for the identification/isolation of ligands and targets m 
such bacteria which, when interacted with, can lead to e.g. 
increased efficacy m degradation of specific organic or inor- 
ganic substrates. 

2C As will be appreciated from the above, the present invention 
therefore is highly useful in prokaryotic systems. 

Tor the ourposes of using the method of the invention in 
prokaryotic cells, it is preferred that the prokaryotic cells 
are bacteria selected from the group consisting of Bacillus spp. 

25 (e.g. B . anthracis, B. subtilis and B . cereus) , Clostridium spp. 
(e.g. C. botulinux, C. difficile, C. parfringens, and C. teta- 
ni) , Cory/2e£acteriu-m spp. (e.g. C. diphtheriae , and C. pyoge- 
nes), Staphylococcus spp. {e.g. S. aureus and S. albicans) , 
Streptococcus spp. (e.g. S. pneumoniae, S. pyogenes, and S. 

30 agalactiae) , Escherichia coli, Serratia marcescens, Klebsiella 
spp. (e.g. K. pneumoniae) , Proteus spp. (e.g. P. mirabilis) , 
Citrobacter spp. (e.g. CitrcJbacter freundii) , Salmonella spp. 
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(e.g. S . t Vior: i / 5 . typhimuri urr , S . s h ot tmuiieri ar.d 5 . paraty- 
phi). Shigella sop. (e.g. 5. dyszenter la e , S. flexr.eri, S. 
boydii, ar.d 5. sconei), Pseudomonas spp. (e.g. P. aeruginosa, P. 
pseudo/naliei, anc P. .^Jlei), Acinetoi?acter spp., Aeromonas 
S spo., Plesi omonas spp., Yersinia spp. (e.g. r*. pestis, Y . 

enterocolizica, and Y\ pseudotuberculosis) , Francisella tularen- 
sis, Vibrio spp. (e.g. V. cholerae and V. pa rahaemolyt icus) , 
Campylobacter spp. (e.g. C. jejuni and C\ col:), Helicobacter 
pylori, Haemophilus spp. (e.g. K. mfl uen;:ea , H . pa ra in £ luenzae r 

10 and rt. aegyptius) , Bordetella spp. (e.g. B. pertussis, B . 

parapertussis, and B . bronchi sept i ca ) , Brucella spp., Neisseria 
spp. (e.g. W. goiinorhoeae and tf. nieni ngi t i dis) , Treponema palli- 
dum, Leptospira interrogans; Borrelia spp. (e.g. 3 . burgdorferi 
sensu stricto, B. garinii, B . afzelii, and B. recurrent is) , 

15 Legionella pneumophila, Listeria monocytogenes, Mycobacterium 

spp. (e.g. M. tuberculosis, M. bovis, M. africanum, M . kansasii, 
and M. leprae) f Treponema pallidum, Chlamydia trachomatis, 
Actinomyces spp,, Rickettsia spp., and Mycoplasma spp. (e.g. M. 
pneumon iae) . 

20 This list of bacteria thus entails bacterial families and spe- 
cies which are involved in pathology of a large number cf dis- 
eases in humans. Of these, E. coli and B . subtilis are also used 
for industrial fermentation; this is also the case for lactic 
acid bacteria, notably Lactococcus spp. and Lactobacillus spp. 

25 and therefore it is also preferred -hat the inventive methods, 
when employed in bacterial systems, are performed on such non- 
pathogenic species . 

When using the inventive methods in a prokaryotic system, it is 
very often interesting :o identify those cells which are im- 
30 paired in growth or lethally damaged due to the presence of the 
Heterologous expression product introduced according to the 
invention. This, however, is not completely unprobiemat ic since 
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zhe main p h o r o tyoic trai. c s 



scciared e.g. bacterial death 



is absence of 



the bacter i urn . 



I- is therefore necessary to device an exper irr.er.tal so* jp which 
will allow identification o: cells transformed so as to be less 



5 good survivors than other otherwise corresponding cells. One 
advantage is chat expression of the heterologous genetic mate- 
rial is under the contr:)] or an inducible promoter. In this way 
it is possible to expand colonies of cells which have been 
transformed with genetic material which, when expressed, is 
10 lethal or g row t n- irr.pa i r ing to the cells. After that, the expres- 
sion can be switched on and careful examination of expanded 
colonies should reveal those clonal colonies which do not follow 
the same growth pattern as e.g. an unt rans formed control. 

One method of doing tnis is to spread transformed cells on 

lb plates with growth medium and allow the ceils to grow up to a 
pre-determined average size. The spreading of cells should be 
sucn rhat the visioie colonies forming will generally be com- 
prised on one single clone of cells. When the pre-aet ermmed 
size of colonies have been reached, all plates are blotted to a 

20 carrier medium so as to prepare a "negative" of each agar plate. 
After this, the express! on of the inserted random sequences is 
induced and the colonies are allowed to grow again. The plates 
are examined continuously or at suitable intervals (e.g. by 
means of digital image processing systems well known to the 

25 skilled person) . Those colonies which reveal an impaired or 
arrested growth compared to the remainder of the colonies or 
compared to controls are thereafter identified, since the growth 
oattern of each colony in an automated manner can be followed. 
These colonies can then be identified and isolated from the 

30 "negative" blct and it is thereafter a relatively simple proce- 
dure to extract the transforming genetic material and determine 
the sequence thereof. 



In this context, one interesting option is tc renaer antibiotic- 
resistant bacteria non-resistant. The growth medium either 
contains, cr is during culturir.g enriched with, the antibiotic 
in question and the colonies which upon induction of expression 
5 can be demonstrated to be less drug resistant than controls are 
examined further. Also pure bactericidai/bacteriostatic effects 
can be examined- In such an embodiment, the oacteria are e.g. 
cultured on a suitable growth medium. Those colonies which after 
induction of expression shows evidence of reduced or arrested 
1C growth are examined further: It is expected that some of the 
bacterial cells will be demonstrated to carry genetic material 
encoding an expression product which interacts with novel (or 
;<nown) targets for antibacterial agents. 

Another phenotypic trait of interest is of course superior 
15 survival of cells. It is, when dealing with utilisable bacteria, 
of interest to identify targets which will increase the survival 
rate of the bacteria. For example, bacteria used m industrial 
fermentations normally can be lethally aamaged as a consequence 
of their own uncontrolled production of heterologous expression 
20 products. If genes or target molecules can be identified which 
have a positive effect on the survival of such bacteria, the 
economic potential is enormous, since a fermentation process 
will be rendered more economic (less need to startup of new 
fermentatior-s). Similarly, bacteria used in e.g. waste water 
25 purification can be made more resistant against toxic agents in 
their er.viro nme n t . 

The experimental setup in this context is relatively simple: The 
transformed bacteria are simply subjected to the potentially 
lethal condition, and only colonies which exhibit a superior 
30 survival are isolated and examined (and that will typically be 
the colonies which are detectable) . A setup like the above- 
described for identifying cell death should thus not be neces- 
sary . 



Finally, a larqe group ct pheno:ypi: traits to be examinee are 
chose w.mcn can b e detected by e.g. b i : c h e m 1 c a 1 or 1 mm a n o 1 c g ; c a ] 
means. It is expected :hat :he net nod 01 the invent ion w:_ 
allow for identification of syster.s ir. bacterial ceils which, 
when properly modulated, can render the bacteria more effective 
as producers in industrial fermentation. Such pnenctypic traits 
could e.g. oe chances in enzyme activity, cnanges m receptor 
density, changes in expression rate etc. 



Special considerations apply when the randomized expression 
10 product is fused to a fusion partner wruch decides the final 

location of the expression product. Signal sequences in prokary- 
otes are well-known in the art, but it should briefly be men- 
tioned that rr.embrane-anchoring signals are known, and it is also 
possible to export the expression product to the periplasmic 
15 space of bacteria. Finally, it is also possible to include 

secretion signals so as :o allow the isolation of the expression 
product from culture supernatant. However, in many cases it is 
of course most relevant to keep the expression product inside 
the pr oka ryot ic cytoplasm . 



20 Use of the method in eukarvotic systems 

It is especially preferred that the inventive method utilises 
eukaryotic cells as the substantially identical cells in order 
to allow screening for active biomolecules . Hence, these eukary- 
otic cells can be fungal cells, protozoan cells, animal cells, 
25 and plant cells. 

As is the case for oacteria, a number of fungi are pathogens in 
mammals, and therefore the present technology will, in parallel 
with what has been described above concerning antibacterial 
agents, be useful for identifying antifungal agents by using 
30 pathogenic fungi as the substantial identical cells m the 
method. Furthermore, fungi (especially yeast strains), like 
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bacteria, are also utilised in fermentation processes (e.g. in 
the wine and brewing inaustries) , ana the method car. therefore 
also be utilised using such non-pathogenic fungi as the substan- 
tially identical, ceils which are transformed with the vectors, 
5 whereby improvements in these strains can be obtained. 

Preferred examples of fungi serving as the eukaryotic cell in 
the inventive methods are Kpidermophyton spp., Trichophyton 
spp., Micrcsporum spp., Candida albicans, Philophora spp., 
Coccidioides immitis, Histoplasma capsulatum, Blastomyces 
10 dermatitidis, Paracoccidioides brasiliensis, Crypcococcus 
neoformans, Aspergillus sop., Saccha romyces cerevisiae, 
Klyveromyces lactis, and Piccia pastoris. 

Unlike fungal cells, protozoan cells are only relevant as patho- 
gens for humans and other mammals, although some protozoans form 

15 part of biocultures conducting biological waste water purifica- 
tion. The method of the invention is therefore contemplated to 
be useful in identifying new targets for an t i proto zoan agents. 
In this context, the preferred protozoan ceiis used as the 
substantially identical cel. in the methods of the invention are 

2C selected from the group consisting of Giardia lamblia, Trichomo- 
nas vaginalis, Dientamoeba fragilis, Trypanosoma sop., Leishma- 
nia spp., Entamoeba histolytica, Naegleria fowleri, Acanthamoeba 
castellan!, Harmanella spp., Isospora belli, Cryptosporidium 
spp., Sarcocystis spp., Toxoplasma gondii, Plasmodium spp. [e.g. 

25 P. falciparum, P. vivax, P. malariae, P. knowlesi, and P. ova- 
le) , Babesia spp., and Balantidium coli. 

Also plant cells are according to the invention interesting as 
target eukaryotic cells. The plant cells can be any plant cell 
which can be subjected to genetic engineering techniques allow- 
30 ing for single ceil expression and growth. Thus, cells derived 
from e.g. Nicotiana tabacum (tobacco plant), Arabidopsis thali- 
ana, Brassica napus, Brassica juncea, Musa sp . (banana plants), 



rice, anc i corn are exanplos A plant cells useful in the lrven- 
:io:: . -no skilled person, in the art of pianr. genetic engineering 
will k::cw to choose suitable plar.t cells in th? appropriate 
stage cf their life cycle, suitable vector systems as well as 
5 suitable transformation methods. A short summary is given sere: 

As other organisms, plant cells can be transformed with 
foreign DNA. One strategy tor plant transformation employs 
Agrcbacterium tumefaciens, a naturally occurring plant 
pathogenic oactorium. which contains a plasmia (the T_ 

1C plasmid) with the ability to enter plant cells and insert a 

pert inn of its genome into plant chromosomes. The Ti plas- 
mid has been engineered to make it a vector for plant 
transformation by including sequences for replication in E. 
eeli and Agrobacterium, unique restriction sites for m- 

-5 sent mg foreign genes, and selectable markers. 

Unfortunately, Agrobacterium is net very effective at 
transforming monocors, a large group of plants tnat in- 
cludes all of the agriculturally important cereals. Le- 
gumes, another important group of lood crops, are also 

20 difficult to transform and regenerate with Agroba c:t en um . 

Therefore, a second strategy for transforming plants has 
been developed involving the Gene Gun, where a gene is 
inserted into an expression vector and coated onto beads. 
The DNA-coated oeads are then introduced by means of the 

25 gene gun into plant cells, where a small fraction are taken 

up and incorporated into the DNA. Individual plant cells, 
callus, regenerating snoots, and embryos are all suitable 
targets in this technique. 



30 



To determine whether cells have actually incorporated 
foreign DNA and become transgenic, reporter genes such as 
GUS and Luciferase genes are used. In any case, foreign 



cenes must be iianked by a plant promoter in order to be 
expressed . 

It is preferred to use cells derived from animals. These cells 
can be mammalian cells, arthropod cells such as insect cells, 
5 avian ceils, and piscine cells. A number of reasons can be 

listed for using such cell types which each require a relevant 
choice of transformation and expression systems, growth condi- 
tions, etc, all easily determined and chosen by the skilled 
person. It suffices to note that e.g. certain insects cause 

10 enormous problems in human society (due to their direct damaging 
activities or due to their functions as vectors carrying infec- 
tious agents), and therefore the method of tne invention would 
supplement in tne attempts of con-rolling such insects. As for 
the mammals, birds and fish, a number of these are important in 

15 agri- and aquaculture, where disease control is of interest. 

According to the invention mammalian cells such as human 
cells/cell clones or human cell lines are most preferred. This 
is due to the fact that a large number or diseases in humans and 
other mammals et i olog ica 1 ly depend on molecular interactions in 

20 the living cell - the provision of drugs or lead compounds which 
interact in vivc with biomolecules which play a role in diseases 
is therefore of great interest. Preferred mammalian cells are 
Chinese hamster ovary (CHO) cells, VERO cells, HeLa cells, W138 
cells, BHK cells, COS-7 293 cells, and MDCK cells, which are all 

25 well-known in the art. 



The vector is preferably selected from the group consisting of 
retroviral vector, a vaccinia virus vector, an adenoviral vec- 
tor, an adeno associated virus (AAV) vector, a herpes simplex 
virus (HSV) vector, an alpha virus vector, and a semliki forest 
30 virus vector. 
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The candidate nuclei:: acids are hence introduced into oukaryotic 
cells as part of a vectcr to screen for modulators of target 
enzyme act :vi:y. By the terra vv :n::oauced ntc" _ s herein meant 
that the nucleic acids enter the cells in a manner suitable fcr 
5 suoseqjen- expression zt the nucleic acid. The method of intro- 
duction is largely dictated by the targeted cell type, cf. 
below. Exemplary, out non- 1 imi t ing methods include CaPO. preci- 
pitation, liposome fusion, 1 ipcf ect in®, elect roporat ion , viral 
infection, etc. 

10 The randomly modified nucleic acids are preferably integrated 

into the nest cell genome (e.g. by means of retroviral infection 
of the host cell), cr may exist either transiently or stably in 
the cytoplasm (i.e. through the use of traditional plasmids 
utilizing standard regulatory sequences, selection markers, 

15 etc.). As many pha rma ceut ica 1 1 y important screens require human 
or model mammalian cell targets, retroviral vectors capable of 
transfecting such targets are preferred. 

In a preferred embodiment, the candidate nucleic acids are part 
of a retroviral virion which infects the cells. Generally, 

2C infection of the cells is straightforward with the application 
of the infection-enhancing reagent polybene. Infection can be 
optimized such that the cells predominantly express a single 
construct each, e.g. by using the ratio of virus particles to 
the number of cells. Alternatively, it is possible to "screen 

25 out'' cells which have been infected with more than one single 
virion, e.g. by quantitatively assessing a selection marker and 
only. The rate of infection is well-known to follow a Poisson 
distribution . 

A preferred embodiment of the invention where the substantially 
30 identical cells are eukaryotic thus comprises that step (a) is 
carried out by 



M&fi BioieeMTdlSircen: DK I -PK/UKS-OTOO 



trans feeding suir.able packaging cells witn vectors com- 
prising the randomly modified nucleotide sequences and 
which are mteg ratable in virions produced by said packag- 
ing cells, 

culturing said transfected packaging cells in a culture 
medium under conditions which facilitates production by 
the packaging cells of virions containing the randomly 
modified nucleotide sequences, 

recovering and optionally concentrating said virions, and 
10 (iv) transducing said substantially identical cells with the 



in 



Thus, the candidate nucleic acids are introduced into the ceils 
using retroviral vectors. Currently, the rr.ost efficient gene 
transfer methodologies harness the capacity of engineered vi- 

15 ruses, such as retroviruses, to bypass natural cellular barriers 
to exogenous nucleic acid uptake. The use is well-known in the 
art of helper-defective packaging cell-lines which are capable 
of producing all necessary trans proteins (gag r pol, and env) 
required for packaging, processing, reverse transcription, and 

20 integration of recombinant genomes. Those RNA molecules which 
have in cis a i f packaging signal are packaged into maturing 
virions. In eu^aryotes, retroviruses are preferred for a number 
of reasons. First, their derivation is fairly easy. Second, 
unlike Adenovi rus-mediat ed gene delivery, expression from retro- 

25 viruses is long-term (adenoviruses do not integrate) . Adeno- 
associated viruses have limited space for genes and regulatory 
units and there is some controversy as to their ability to 
integrate. Retroviruses therefore currently provide the best 
compromise in terms of long-term expression, genomic flexibility 

30 and stable integration, among other features. The main advantage 
of retroviruses is that their integration into the host genome 
allows for their stable transmission through cell division. This 
ensures that in cell types which undergo multiple independent 
maturation steps, such as hematopoietic cell progression, the 
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retrovirus construct will renair. resident and continue to ex- 
press . 

Prefer red retroviral vectors are icrivod iror, retrovirus se- 
lected from the group consisting of Avian Leukosis-Sarcoma Virus 
5 (ASLV) , Karrrr.alian type C, Mammalian type 3, and Lentivirus. They 
may all optionally be modified with heterologous cis-acting 
elements . 

It is preferred that the retroviral vector has non-identical 
ends so as to facilitate PCR-based generation of random DNA 
10 sequences. It is furthermore preferred that these non- ident ica 1 
ends contain non - ident ica 1 promoters. 

An especially preferred retroviral vector contains a heterolo- 
gous promoter replacing the viral promoter in the S'-LTR, such 
as a CMV promoter, an RSV promoter, ah SV-40 promoter, a TK 
15 oromoter, an MT promoter, or an inducible syszem such as Tet or 
Ecdysone. A suitable vector is described in WO 96/38553, cf. 
Fig. 1 and accompanying text. 

Particularly well suited retroviral trans feet ion systems are 
PE501 (US 4,861,719), Bosc23 (WO 94/19478), Psi2 (R. Kulligan/D. 

20 Baltimore), GP+E86 (US 5,273,056), PhoenixEco (WO 97/27212), 
PA317 (US 4,861,719), GP+AM12 (US 5,278,056), DA(ampho) 
(WC 95/1C601, WO 92/05266), Bing (WO 94/19478), FLYA13 
(WO 97/08330), ProPak (available from SyStemix) , CRI? (R. Mulli- 
gan), PsiAM (R. Mulligan/D. Baltimore), Phoenix-Ampho 

25 (WO 97/27212), PG13 (Targeted Genetics), H9 (293GPG) (D. Ory, M 
Sadelain, R- Mulligan, J. Schaffer) . 

In one embodiment of the present invention, the library is 
generated in a retrovirus DNA construct backbone, as is gene- 
rally described in e.g. WO 97/27212. Standard oligonucleotide 
30 synthesis is performed to generate the random portion of the 
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car.aiaate modulator, using :echr.:ques well known in the art (cf. 
Ec:<3-ein, Oligonucleotides and Analogues, A Practical Approach, 
IRL Press At. Oxford University Press, 1991); libraries may be 
commercially purchased. Libraries with up to 10' unique sequen- 
5 ces can be readily generated in such DHA backbon.es. Afier gene- 
ration of the DNA library, the library is cloned into a first 
primer. The first primer serves as a "cassette" which is inser- 
ted into the retroviral construct. The first primer generally 
contains a number of elements, including for example, the 

1C required regulatory sequences {e.g. translation, transcription, 
promoters, etc) , fusion partners, restriction endonuclease 
(cloning and subcloning) sites, stop codons (preferably in all 
-hree reading frames, regions cf complementarity for second 
strand priming (preferably at the end of the stop codon region 

15 as minor deletions or insertions may occur in the random re- 
gion) , etc. 

A second primer is then added, which generally consists of some 
or all of the complementarity region to prime the first primer 
and optional necessary sequences for a second unique restriction 
20 site for subcloning. DNA polymerase is added to make double- 
stranded oligonucleotides. The double-stranded oligonucleotides 
are cleaved with the appropriate subcloning restriction 
endonucieases and subclonea into the target retroviral vectors, 
described below . 



25 Any number of suitable retroviral vectors may be used. Gene- 
rally, the retroviral vectors may include: selectable marker 
genes under the control of internal ribcsoxe entry sites (IRES), 
which allows for bicistronic operons and thus greatly facili- 
tates the selection of cells expressing peptides at uniformly 

30 high levels; and promoters driving expression of a second gene, 
placed in sense or anti-sense relative no the 5 ' -LTR (long 
terminal repeat) . Suitable selection genes included, but are not 
limited to, neomycin, biastocidm, bleomycin, puromycin, and 
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hvgromycin resistance qenes, as well as self-flu ores c v. r \ t markers 
such as green fluorescent protein, enzymatic markers such as 
lazZ, ana surface proteins such as CDS, etc. 

Preferred vectors include vectors derived from retrovirus se- 
5 lettea from the group consisting of Avian I.eu kos i s-Sa r coma Virus 
( A3 IV ) jyamxalian type C, Mam.malian type B, and Lent i. vi rus as 
well as vectors derived from MSCV (murine stem cell virus), 
-rod-fled MFG virus and pBABE, and optionally modified with 
heterologous cis-acting elements. 

10 The retroviruses may include inducible constitutive promoters. 
7or example, there are situations wherein it is necessary to 
induce peptide expression only during certain phases of the 
selection process. For instance, a scheme to provide pro-inf lam- 
rr.atcry cytokines in certain instances must include induced 

15 exoression of tne peptides. This is because tinere is some expec- 
tation that over-expressed pro-inflammatory drugs might in the 
long-term be detrimental to cell growth. Accordingly, constitu- 
tive expression is undesirable, and the peptide in only turned 
on during char phase of the selection process when the pher.otype 

2C is required, and then the peptide is shut down by turning off 
the retroviral expression to confirm the effect or ensure long- 
term survival of the producer ceils. A large number of both 
inducible and constitutive promoters are known to the skilled 
person . 

25 In addition, it is possible to configure a retroviral vector to 
allow inducible expression of retroviral inserts after integra- 
tion of a single vector in target cells; importantly, the entire 
system is contained within the single retrovirus. Tet-inducible 
retroviruses have been designed incorporating the Self -Inacci- 

30 vating (SIN) feature of 3 ' LTR enhancer /promoter retroviral 

deletion mutant (Hoffmann et ai., PNAS U.S.A. 93:5185 (1996)). 
Expression of this vector in cells is virtually undetectable in 
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the cresence of tetracycline or other active analogues. However, 
in T.he absence of Tet, expression is turned on to maximum within 
48 hours after induction, with uniform increased expression of 
the whole population cf cells that harbour the inducible reiro- 
5 virus, indicating that expression is regulated uniformly within 
the infected cell population, a similar, reiatea system uses a 
mutated Ten DNA-oir.ding domain such that it bound DNA in the 
presence of Tet, and was removed in the absence of Tet . Either 
of these systems is suitable. 

10 In this manner the primers create a library of fragments, each 
containing a different random nucleotide sequence within a 
scaffold sequence derived from genetic material encoding a 
enzyme modulator. The ligation products are then transformed 
into bacteria, such as E . coli and DNA is prepared from the 

15 resulting library, as is generally outlined in Kitamura, PNAS 
U.S.A. 92: 9146-50 (1995), which is incorporated by reference 
herein . 



Delivery of the library DNA into a retroviral packaging system 
results in conversion to infectious virus. Suitable retroviral 
2C packaging system cell lines include, but are in no way limited 
to, PE5C1, Bosc23, ?si2, GP+E86, PhoenixEco, PA317, GP+AM12, 
DA(am.pho), Bing, FLYA13, ProPak, CRIP, PsiAM, Phoenix-Ampho, 
PG13, and H9 (293GPG) . 

In a preferred embodiment, the ceil lines discussed above, and 
25 the other methods for producing retrovirus, are useful for 

production of virus by transient t ra ns feet ion . The virus can be 
either used directly or used to infect another retroviral pro- 
ducer cell so as to expand the library. 

Concentration of virus may be done as follows: Generally, retro- 
30 viruses are titred oy applying ret rovirus-ccntaining supernatant 
onto indicator cells, such as NIH3T3 cells, and then measuring 
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the percer.tace of colls expressing pnenctypic consequences of 
■'r.fection. The concentration of the virus is determined by 
multiplying the percentage of cells infected by the dilution 
factor involved, and taking into account the number of target 
b cells available to obtain a relative titre. If the retrovirus 
contains a reoorter gene, such as 2acZ, then infection, integra- 
tion, and expression of the reconbinant virus is measured by 
histological staining for lacZ expression or by flow cytometry 
(FACfJ). In general, retroviral titres generated from even the 

10 best of the producer cells go not exceed 10' per mi, unless 
concentration is perforrnea on relatively expensive or exotic 
apparatus. However, it is believed that particles as large as 
retrovirus will not move very far by means of brownian motion in 
liquid, fluid dynamics predictions show that much of the virus 

15 never comes in contact with the cells in order to initiate 
infection. However, if cells are grown or placed on a porous 
filter surface and retrovirus are allowed to pass the cells by 
gradual gravi tomet r ic flow, a high concentration of virus around 
cells can be effectively maintained at all times. Thus, up to a 

20 ten-fold higher infectivity by infecting cells on a porous 

membrane and allowing retrovirus supernatant to flow past them 
has been seen. This should allow titres of 10 r after concentra- 
tion . 

•Jpon isolation/concentration of virus, the substantially identi- 
25 cal target cells are transduced by methods well-known in the 
art . 

As mentioned above, fusion of the expression product to at least 
one fusion partner which facilitates expression and/or purifica- 
tion/isolation and/or further stabilization of tne expression 
30 product is often desired. 

In eukaryotes, the fusion partner is often a sorting signal or a 
targeting sequence. Such a sorting signal will be in the form of 



MAI BKSKck'Crtl Screen? I>K. ;;PK;HK»-O7.20 



3 4 

a signal catch or a signal peptide. The well-known function of a 
sor - :n g sicrr.al is :o effect, export of an expresses peptide into 
endoolasmic reticulum, into Golgi apparatus, into lysosomes, 
into secretory vesicles, into mitochondria, into peroxisomes, or 
5 into the nucleus. Of course, also export to the membrane or out 
of the cell are possibilities. Preferably, the sorting signal or 
targeting sequence is selected from the group consisting of 

a nuclear localization signal (NLS) such as Prc-Lys-Lys- 
Lys-Arg-Lys-Vai {SV40 large T antigen NLS), Aia-Arg-Arg- 
10 Arg-Arg-Pro (human retinoic acid receptcr-3 NLS) , Giu-Glu- 

Vai-Gln-Arg-Lys-Arg-Gln-Lys-Leu (NF k3 p50) , Glu-Glu-Lys- 
Arg-Lys-Arg-Thr-Tyr-Glu (NFkB p65), and Ala-Va 1 -Lys-Arg- 
Pro-Aia-Ala-Thr-Lys-Lys-Ala-Gly-Gln-Ala-Lys-Lys-Lys-Lys- 
Leu-Asp (Xenopus nucleoplasmin NLS) ; 
15 - a membrane anchoring sequence such as those derived from 

CDS, ICAM-2, IL-8, CD4 , and LFA-1, and a lipidation se- 
quence such as a myr istylation or a palmitoylat ion se- 
quence ; 

a lysosomal sorting signal such as a lysosomal degradation 
2C seauence, and a lysosomal membrane sequence 

a mitochondrial localization sequence such as a mitochon- 
drial matrix sequence, a mitochondrial inner membrane 
sequence, a mitochondrial intermembrane space sequence, 
and a mitochondrial outer membrane sequence; an endoplas- 
25 mic reticulum localization sequence such as the sequence 

from calreticulin (KDEL) and the sequence from adenovirus 
Z3/19K protein (LULSRRSFIDEKKMP) ; 

a peroxisome sequence such as the peroxisome matrix se- 
auence from L-ucif erase; 

30 - a f arnesylation sequence such as LNPPDE3GPGCMSCKCVLS; 

a geranylgeranyiation sequence such as LTEPTQPTRNQCCSN ; 
a destruction sequence such as RTALGDIGN; and 
- a secretory signal sequence such as the secretory signals 

from IL-2, growth hormone, preproinsulin , and influenza HA 

35 Droteir.. 
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E nzyme rrodu.ators useful i n the irven: io:i 

The art has demonstrated the existence of numerous effective 
peptide enzyme activity modulators, '-'.specially the enzyme 
inhibitors are wel 1 -characterized. Non-limiting examples which 
5 are all incorporated by reference herein are listed in the 
foil owing : 

BPTI/Kuni tz family of protease Inhibitors : 

Pancreatic trypsin inhibitor [BPI!) from Bos vaurus; 
Spleen trypsin inhibitor from Bos taurus; 
10 Inter-alpha-trypsin inhibitor light chain (bikunin) from Bos 

taurus, Homo sapiens, Merlones ur.gu icula tus , Mesocricetus aura- 
tus, Mus musculus, Sus scrofa, Pieuronectes platessa , and Rattus 
norvegicus , respectively ; 

Ir.ter-aloha-trypsin inhibitor from Equus cabailus, Cvis aries, 
15 and Capra hircus, respectively; 

Henoiynoh trypsin inhibitor A from Manduca sexta; 

Hemolymph trypsin inhibitor B from Manduca sexta; 

Colostrum trypsin inhibitor from Bos taurus; 

Trypstatin from Rattus norvegicus ; 
2C Proteinase inhibitor from Tachypleus tndentatus; 

Serum basic protease inhibitor from Bos taurus; 

Chynotrypsin inhibitor SCI-III from Bombyx mori; 

Male accessory gland ser i ne-protease inhibitor from Drosophila 
funebris; 

25 Protease inhibitor 5 II from Anemonia sulcata; 

Chymotrypsin inhibitors SCI-I and SCT-II from Bombyx mori; 
Proteinase inhibitors SHPI and SHPI-2 from Stoichaczis helian- 
thus; 

Isoinhibitor K from Helix pomatia; 
30 Trypsin inhibitor IV from Radianthus macrodactylus; 
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Venom basic protease inhibitors IX and V.IIB fron Dungarus 
fasciatus; 

Venom basic protease inhibitors T and III from Vipera ammcdytes 
ammodytas; 

5 Venom basic protease inhibitor II from Daboia russelli siaman- 
sis, Hamachatus haemachatus , and Naja nivea, respectively; 
Venom basic protease inhibitors B and Z from Dendroaspis polyle- 
pi s polyl epl s ; 

Venorr. chyrr.ot r ypsin inhibitor from Naja naja; 
1C Venom basic protease inhibitors I and K from Dendroaspis polyle- 
pis polylepis ; 

Venom basic protease inhibitor K from Dendroaspis angusticeps 

Venom trypsin inhibitor from Eristocophis macmahoni and Naja 

naja, respectively; 
15 Protease inhibitor from Sarcophaga bullata; 

Tissue factor pathway inhibitor from Homo sapiens, Oryctolagvs 

cuniculus , and Rattus norvegicus r respectively; 

Tissue factor pathway inhibitor 2 from Home sapiens; 

Uterine plasrnin/t ryps i n inhibitor from Svs scrofa; 
20 Protease nexin II [fragment of Alzheimer's disease amyloid A4 

protein) from Homo sapiens, Mus musculus , rattus norvegicus., 

Macaca fascicular is and Macaca mulatta , respectively; 

Amyloid protein 2 from Homo sapiens and Rattus norvegicus, 

respectively; and 
25 Ornithodorin from Orn i thodoros moubata , 

as well as inhibitors homologous therewith isolated from other 

sources than those explicitly mentioned. 

Serpln family of protease Inhibitors: 

Alpha- 1 -proteinase inhibitors (alpha-l-antitrypsins) from Equus 
30 caballus , Mus musculus , Cavia porcellus , Oryctolagus cuniculus , 
Bos taurus, Chinchilla villidera , Didelphis marsurpiales 
virgin iana, Homo sapiens, Macropus eugenii, Mus caroli, Papio 
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ar.ubis, Sus scrcfa, Rattus r.c rve- 1 cu s , and Ovzs arias, respec- 
t i vely; 

Alpha- 1 -ar.tichymotrypsin frcT. Homo sapiens; 

AnULfironbin III ircm Ho^o sapiens, Bcs taurus, Mus musculus, 
5 Qvzs arias, Me soar icetus a u rat us, and Callus gall us, respec- 
tively; 

Alpha-2-plasmin inhibitor (alpha-2-ant iplasmin) from Bos taurus 
Homo sapiens , and Mus musculus , respectively; 
Z2 s~ *t* t i n ^p^.^r^^c^ [rhibit^r 10) fr or 1 . Homo saoi en s ; 
10 Ccntrapsin from Mus musculus and Cavia porcelius; 

Cont rapsin- I i ke oroiease inhibitors f :on Kattus norvegicus; 
Factor Xlla inhibitor from Bos taurus; 

Glia derived nexin (protease nexin I) from Homo sapiens, Mus 

musculus, and Rattus norvegicus, respectively; 
15 Heparin co-factor II from Homo sapiens, Mus musculus, Orycto la- 

gus cuniculus, and Rattus norvegicus, respectively; 

47 kDa heat shock protein (serine protease inhibitor J6) from 

Mus musculus and Gallus gallus r respectively; 

Cl-inhibitor from Homo sapiens; 
20 Leukocyte elastase inhibitor from Equus cabal lus, Homo sapiens, 

and Sus scrofa, respectively; 

Protein C inhibitor from Homo sapiens; 

Kallistatin from Homo sapiens; 

Kallikrein-binding protein from Mus musculus and Rattus norvegi 
25 cus f respectively ; 

Maspir. from Homo sapiens, Mus musculus, and Rattus norvegicus, 
respect ively; 

Plasminogen activator inhibitor-1 from Bos taurus, Homo sapiens 
Mus musculus, Mustela vison, and Rattus norvegicus, respective- 
30 ly; 

Plasminogen activator innibitor-2 from Homo sapiens, Mus muscu- 
lus, and Rattus norvegicus, respectively; 

Neuroserpin from Homo sapiens, Mus musculus , and Gallus gallus 
Cytoplasmic antiproteinases 1, 2 and 3 from Homo sapiens 
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Antitrypsin frcT Bombyx mori, respectively; 
Ant ichynot rypsins I and II from Bombyx men; 
Alaserpir. fron Manduca sexta; 

Serine protease inhibitor 2.1 front Rattus norvegicus; 
5 Serine proteinase mnibitor 1 from Ccwpox virus, Rabbit pox 
virus, Swine pox virus, Vaccinia virus, and Variola virus, 
respectively; 

Serine proteinase inhibitor 2 from Rabbit pox virus, Vaccinia 
virus, and Variola virus, respectively; 
10 Serine proteinase inhibitor 3 from Vaccinia virus and Variola 
virus, respectively; 
Serpin 1 from Myxoma virus; 

Serine proteinase inhibitor from Halccynthia roretzi; 
Ice inhibitor from Cowpox virus ( thiol-prot ease inhibitor); 
15 as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Kazal family of protease xnhxhitors : 

Acrosin inhibitors from Bos taurus , Memo sapiens, Sus scrofa, 
and Macaca fascicular is , respectively; 
20 Elastase inhibitor from An em on i a sulcata; 
Ovomhibitor from Gallus gallus; 
Rhodniin from Rhodnius prolixus; 

Pancreatic secretory trypsin inhibitor from Rattus norvegicus, 
Anguilla anguilla, Bos taurus, Canis familiaris , Homo sapiens, 
25 Sus scrofa and Ovis aries, respectively; 

Pancreatic secretory trypsin inhibitor II from Rattus norvegi- 
cus; 

Double-headed protease inhibitor (submandibular gland) from 
Cams familiaris , Felis silvestris catus, Meles meles, Panthera 
30 leo, and Vulpes vulpes f respectively; 

Trypsin inhibitor from Halocynthia roretzi; 
Tryptase inhibitor from Hirudo medicmalis; 



MAE BrtHectvCctlScietnZ l>K ).PK-'t WS-0--:0 



39 



Prostatic se:retory glycoprotein from Mus musculus ; 
Ovornuco id (tr.ird dorr, a ir: ) f r err Aburrj a p if: i ie , Aepypodi us 
a r t'ak i anus , Afr Dpavo congons js, A lectori s chukar , Alec tor Is 
ru fa , An as pi a t y r hyn ch ds , Ch loephaga pier a , Cya n ochen c ya nop- 
b tera , Neochen jubata, Tadorna radjah, Lophonetta specu la r io ides , 
Ana s capons i s , Aix ga 1 erj cu lata , Aix sponsa , Sa rk idiom l s 
ir.elanotos , Alopochen aegypt iaca , Mezgus cucullatus , Anhinga 
n.ovaehol landiae, Anser anser anser, Anser mdicus, Anseranas 
semipa lmata , Arborophila torqueola, Arousianus arqus, Aythya 

1C amencana , Netta rufina, Balearica pavomna , Baw.bus icola 

thorazica , Bonasa umbel I us , Bran ta canadensis , Anser canagrcus , 
Call ipepla squamata castanogastnc, Cal 1 ipepla squamata pallida, 
Ca rpodacus mex icanus , Ca rpccoccyx rena uldi f Ca sua rius casuar ius , 
Casuari us ben net ti , Cereopsis novaehol landiae, Chauna chavaria , 

15 Chauna torquata, Gallus gallus, Chrysolophus amherstiae, 

Chrysolophus pictus, Circus aeruginosas. Col mus virginianus, 
Corpus albus , Corvus monedula , Coscoroba coscoroba , Coturnix 
delegorguei, Ccturnix coturnix japonica, Crossoptilon crossopti- 
ion , Cygnus atra tus , Cygnus oler , Cxyura jamaicensis , Oxyura 

20 vittata, Cyrtonyx mcntezumae , Dacelo novaeguineae , Dendrocygna 
a rborea f Dendrocygna a rcua ta , Dendrocygna aut umnal is , Dendro - 
cygna bicolor , Dendrocygna eytoni r Dendrocygna viduata, Dromaius 
novae -hoi la ndiae , Eudromia elegans , Francolinus afer coqui , 
Franco 1 inus erckel i± , Franco 1 inus francolinus , Francolinus 

2 5 pondicerianus , Fulica atra , Gallinula ch lor opus , Gallirallus 
australis , Gallus varius , Geococcyx calif ornianus f Grus 
carunculatus , Grus japonensis , Grus vipio , Anthropoides virgo, 
Guira guira , Gutter a pucherani , Gyps coprotheres , Polyboroides 
radiatus , Aquila audax f Mecrosyrtes rr.onachus , Hal iaeetus albi- 

30 cilia r Haiiastur Indus , Larus ridibundus , Larus mar inus , Vanel- 
lus spinosus / Leipoa ocellata , Lophura bulweri , Lophortyx 
californica , Lophortyx gambelii , Lophura igni ta , Lophura diardi , 
Lophura leucomelana , Megapodius f reycinet , Meleagris gallopavo, 
Agriochari s ocellata, Nothoprocta cinerascens , Nothoprocta 



cerdicaria, Num i da m*?leagris, Aery Ilium vulturinuni, Nycticorax 
nycti-~orax, Opi sthoccmus hoazm, Greoriyx pictus, Ortaiis ve tu- 
la, Pavo cristatus, Pavo nruticus, Penelcpe jacquacu, Penelope 
supercxl laris , Perdrix perdrix, Phasianus colchicus colchicus , 
5 Phasianus versicolor, Pha 1 ac rocorax sulcirostr is , Podargus 
strigoides , Polyplectron bioa Icaratvm , Polyplectron emphanvm , 
Polybcrus plancus, Pygoscelis adelie, Pha lacrocora x albivenzer, 
Rhea americana , Pterocnemia pennata , Rhynchotus rufescens , 
Rollulus roulroul , Scythrops novaehol lar.diae , Spheniscus hum- 

1G boldti, Struthio camalus, Syrmaticus mikadc, Syrmaticus reeves- 
21, Tinamus major, Turdus meruLa, Turnix sylvatica , Tympanuchus 
cuoido, Centrocercus urophasianus , Tragopan blythii, Tragopan 
caboti, Tragopan satyra f Tragopan temminckii, Lophophoros 
impej anus , Crossopti Ion auritum, Crossoptilon mantchur icum , 

15 Lophura edwardsi , Lophura nycthemera , Lophura swinhoei , Pucrasia 
macrolopha , Catreus wallichii , Syrmaticus ellioti, Syrmaticus 
hurv.ias, Syrmaticus soemme rrmgi i , lagopus leucurus , and Vultur 
qryphus , respectively, 

as well as inhibitors homologous therewith isolated f rox other 
20 sources than :nose explicitly mentioned. 



Soybean trypsin inhibitor (Kunltz) family of protease Inhibi- 
tors : 

Aspartic oroteinase inhibitor from Solanum tuberosum; 
Cathepsin D inhibitors from Solarium tuberosum; 
25 Wound-induced aspartate proteinase inhibitor from Solanum 
tuberosum; 

Chyrr.otrypsin inhibitor 3 precursor from Psophocarpus 
t e t ra gon ol ob u s ; 

Trypsin inhibitor from Adena.nthera pavonina ; 
30 Trypsin inhibitor from Prosopsis juli flora; 
Trypsin inhibitor from Erythrina caffra; 
Tryosm inhibitor from Erythrina latisslma; 
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Chymotrypsin inhibitor ircn Lryr.hrina vaziegata; 

Trypsin inhirir^rs Ih, IB, and 2 iro.r. ?5^Driocarpjs tatragonolo- 
bus ; 

T r vps in /chyme trypsin inhibitor t r err. Al ocas : a macrorrhiza ; 
5 Trypsin inhibitor from Albiz/.ia julibrissin; 
Trypsin inhibitor from A.cacia confusa; 
Trypsin inhibitors A, 3 and C from Glycine max; 
Trypsin inhibitors KTIlanci KTI2 from Glycine max; 
Tatex serine proteinase inhibitor from Carica papaya; 
10 Cysteine proteinase inhibitor PC?I 3.3 from Solarium tuberosum; 
Kumtz-type inhibitors I and 2 (PXT-i) from Solanum tuberosum; 

Potato inhibitor I family (protease inhibitors) : 

Trypsin/subt i lisin inhibitor frx Ajp.aranthus caudatus; 
Subtilism inhibitor from Momordica charantia; 
15 Wound-induced proteinase inhibitor I from Lycopers icon esculen- 
tum, Lycopersicon peruviar.um, and Solanum tuberosum, respec- 
tively; 

Subtilism inhibitors I and II from Phaseolus angulans; 

Proteinase inhibitor I from Solanum tuberosum; 
20 Chyx.otrypsin inhibitor ?A (CT-2A) from Hordeum vulgare; 

Chymotrypsin inhibitor 2B {CI-23) from Hordeum vulgare; 

Chymotrypsin inhibitor 1A (CI-1A) from Hordeum vulgare; 

Chymotrypsin inhibitor 13 (CI-1B) from Hordeum vulgare; 

Chymotrypsin inhibitor 1C (CI-1C) from Hordeum vulgare; 
25 Chymotrypsin inhibitor I, a, b and c subunits from Solanum 

t ube rosum ; 

Subtilisin inhibitor from Vicia faba; 

Ethylene-responsive proteinase inhibitor from Lycopersicon 
esculentum; 

30 Proteinase inhibitors I-A and 1-3 from Nicotiana tabacum; 
Eglin C from Hirudo medicinalis; 

Inhibitor of trypsin and Hageman factor from Cucurbita maxima; 



4 2 

Trypsin inhibitor MCI-3 from Momordica charantia ; ana 
Trvosin inhibitor 1 from Nicotiana sylvestris, 

as well as inhibitors homologous therewith isolated from other 
sojrces than those explicitly mentioned. 

5 Bowman-Birk family of protease inhibitors : 

3owman-birk type protease inhibitors from Arachis hypogaea, Coix 
lachryma-jobi, Phaseolus angularis, Glycine max, Triticum aesti- 
vu/n, Arachis hypogaea, Phaseolus vulgaris, Setaria italica, 
Dolichos axillaris, Lonchocarpus capassa , Oryza satlva, Hordeum 
10 vulgare, Medicago scuteliata, Phaseolus aureus, Phaseolus luna- 
tus, Vicia angusti folia , Vicia faba, and Vigna unguiculata f 
respectively; and 

Wound induced trypsin inhibitors from Medicago sativa and Zea 
mays, respectively, 
15 as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Squash inhibitor family: 

Elasiase inhibitor from Momordica charantia; 

Trypsin inhibitors I, II, and III from Lagenaria laucantha; 
20 Trypsin inhibitor I from Citrulius vulgaris; 

Trypsin inhibitors I , III, and IV from Cucurbita maxima; 

Trypsin inhibitors I and II from Luffa cylindrica; 

Trypsin inhibitors I and II from Momordica charantia; 

Trypsin inhibitor I from Momordica repens; 
25 Trypsin inhibitor II from Bryonia dioica; 

Trypsin inhibitors 113 and IV from Cucumis sativus; 

Trypsin inhibitor II from Ecballium elaterium; 

Trypsin inhibitors I, II and III from Cucumis melo var. Conomon; 
Trypsin inhibitors II and III from Cucurbita pepo; and 
30 Trypsin inhibitor A from Momordica charantia, 
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as well as inhibitors hcrr.clcgous therewith isolated from o:her 
sources than tnose explicitly mentioned. 



Streptomyces subtilisin Inhibitor (SSI) family: 

Subtilisin inhibitor from St reptcwyces a 1 bogriseolus ; 

5 Subtilisin inhibitor-like protein-1 from Streptomyces cacaoi; 
Trypsin inhibitor ST 12 from Streptomyces 1 ongisporus; 

Subtilisin inhihitr.r-1 ike prote:r.-2 from S t reptomyces rochci; 

Subtilisin inhibitor-like protem-3 from Strepicmyces coeli- 
color; 

10 Subtilisin inhibitor-like protem-4 from Streptomyces laven- 
dulae; 

Subtilisin inhibitor-like protem-3 from St reptomyces virgimae; 

Protease inhibitor SIL-V3 from Streptoverticillium eurocidicus ; 
Protease inhibitor S I L- VI /S I L- V4 from Streptoverticillium 

1 5 f iavopersicus ; 

Protease inhibitor SIL-V5 from Streptoverti cillium luteoverti - 

c Hiatus; 

Protease inhibitor S1L-V2 from Streptoverticillium orinoci; 
Alkaline protease inhibitor LC 1 from St reptomyces gr lseoxncar- 

2 0 nanus; 

Protease inhibitor from Streptomyces lividans; and 
Plasminostreptin from Streptomyces antifibrinolyticus, 
as well as inhibitors homologous therewith isolated from otner 
sources than those expl ici t 1 y mentioned . 



2 5 Bombyx family of protease inhibitors 

Fungal protease inhibitor F (FPI-F) from Bombyx mori f as well as 
inhibitors homologous therewith isolated from other sources than 
Bombyx mori . 
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Wap-type ' Four- dl sulphide Core' Proteinase Inhibitors 

Antileukcprctsinase 1 frcrr. Homo sapiens and Mas musculus, re- 
spectively; 

Elafir. from Home sapiens ana Sus scrofa, respectively; and 
5 Chelonianin (basic protease inhibitor) from red sea turtle, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Hirudin family of protease Inhibitors 

Hirudins from. Hirudo medicinalis and Hirudmaria mam llensis, 
10 respectively, as well as inhibitors homologous therewith iso- 
lated from other sources than those explicitly mentioned. 

Factor Xa Inhibitors 

Antistasin from Haemanter ia officinalis , Haementeria ghilianxi 
Hirudo medicir.alis, Hirudo mpponia, and Hydra magnipapil lata , 
15 respectively, as well as inhibitors homologous therewith iso- 
latea from ether sources than those explicitly mentioned. 

Ascaris trypsin inhibitor family 

Chymotrypsm/elastase inhibitors from Ascaris suum and 
Trypsin inhibitor from Ascaris suum as well as inhibitors homo 
20 logous therewith isolated from other sources than Ascaris suum 

Cy statin family of protease inhibitors 

Leukocyte cysteine proteinase inhibitors 1 and 2 from Sus scro 
fa; 

Stefins 1, 2 and 3 from Mus musculus; 
25 Cystatins Al, A3, A8 and B from Sus scrofa; 
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Nystatins A anc 3 from Bos taurus; 
Sue f in C f rorr. Bos taurus; 
Cystatin 3 f r ox Qv i s aries ; 

Cysiatins A, C, D, M , SN, S and SA r. ro.Ti Homo sapiens; 
5 Nystatins 3 ana C from Mus rriusculus; 

Cys t a t i ns C and S from Rattus norvegicus; 

Cystatins C from Macaca mulatta and Saimiri sci ureas, respec- 
tively; 

Cystatin alpha (epidermal thiol proteinase inhibitor) from 
1C Rattus norvegicus; 

Cystatin 3 (liver thiol proteinase inhibitor) from Homo sapiens 
and Rattus norvegicus , respectively; 

Cystatin (colostrum thiol proteinase inhibitor) from Bos taurus 

Cystatins from Gallus gailus and Co tu mix coturnix j aponica , 
15 respectively; 

Cystatin from Cyprmus carpio; 

Cystatin from Bit is a ri entzas ; 

Cystatin I from Zea mays; 

Oryzacystatin-I from Oryza sativa; 
20 Cryzacystatin-II from Oryza sativa; 

Cystatin A from Hel ianthus annuus; 

Cystatin E from Hoi i an thus annuus; 

Cystatin from Vigna unguiculata; 

Onchocystatin from Onchocerca volvulus; 
25 Cysteine proteinase inhibitor from Solanum tuberosum ; 

Cystatin WCPI-3 from Wisteria floribunda; 

Cystatin from Glycine max; and 

Kininoger.s from Bos taurus r Homo sapiens and Rattus norvegicus , 
respectively, 

30 as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 
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Calpain family of cysteine protease Inhibitors 

Caipastatin fron Bos taurus, Cercopitnecus aethiops , Homo sapi- 
ens, Mus muscuius , Sus scrofa, Oryctoiaqus cur.iculus , Rattus 
no rvegicus , and Ovis arias, respectively, 
5 as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Tissue inhibitor of metalloproteinases family 

Metalloproteinase inhibitor 1 from Bos taurus, Homo sapiens, Mus 
muscuius, Papio cynocapha lus , Sus scrofa, Oryctclagus cuniculus , 
10 Rattus norveg icus , and Ovis aries, respectively; 

Metalloproteinase inhibitor 2 from Bos taurus, Homo sapiens, Mus 
muscuius , Rattus norvegicus , and Gallus gallus, respectively; 
and 

Metalloproteinase inhibitor 3 from. Bos zaurus , Homo sapiens, Mus 
15 muscuius , Rattus norvegicus , and Gallus gallus, respectively, 
as well as inhibitors homologous therewith isolated fron other 
sources than those explicitly mentioned. 

Carboxypeptidase A inhibitors 

Ca rboxypept idase A inhibitor from Ascaris suum , as well as 
20 inhibitors homologous therewith isolated from other sources than 
Ascaris suum. 

Metallocarboxypeptidase inhibitors 

Metallocarboxypeptidase inhibitors from Lycopersicon esculentum 
and Solanum tuberosum, respectively, as well as inhibitors 
25 homologous therewith isolated from other sources than those 
explicitly mentioned. 
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Angiotensin- converting enzyme inhibi tors 

An g 1 o t e n s i n - c Ci n ve r 1 1 :i q e n z yrr.e innibiior f r on un n us a i ba ca res ; 
Anantens]r-.::onverting enzyme inhibitors from Both r ops insula - 
n s; 

5 Angi o ten sin -convert ing enzyrr.e inhibitors iron Both r ops jararaca 
An g j ^t ens i a- conve rt i ng enzyme inhibitor from Agkistrodon halys 
blomncffi ; 

Angiotensin -converting enzyme inhibitor from Agkistrodon halys 
pa lias; and 

1C Angiotensin -converting enzyme inhibitor from Viper a aspis, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 



Pro tease inhibitors not belonging to a specific family 

Ecotin from Escherichia coli; 
15 Metalloproteinase inhibitor from Streptomyoes nigrescens; 

Proteinase inhibitor from Erwlnia ch rysantheml ; 

Proteinase inhibitor from Pseudcmonas aeruginosa; 

Protease A inhibitor 3 from Saccha romyces cerevi s iae; 

Protease B inhibitors 1 and 2 from Saccha romyces cerevisiae; 
20 Major pepsin inhibitor ?I-3 from Ascaris suum; 

Intracellular proteinase inhibitor f rom Bacillus subtilis; 

Proteinase inhibitor PTI from Scianum tuberosum ; 

Proteinase inhibitor PCI - I from Solanum tuberosum; 

Proteinase inhibitor IIA from Solanum tuberosum; 
25 Proteinase inhibitor IIB f rom Solanum tuberosum ; 

Proteinase inhibitors A and B from Sagittaria sagitti folia; 

Proteinase inhibitor from Solanum melongena ; 

Trypsin inhibitor from Brass ica napus; 

Trypsin inhibitor 2 from Sinapis alba; 
30 Trypsin inhibitor from Zea mays; 

Trypsin inhibitor from Sinapis a rvensis ; 
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Trypsin inhibitor from Trichosanthes kirilvwi 1; 

Wo'jnd-ir.ciuced proteinase inhibitor II fr or. lycopersicon esculen 
t urn ; 

Protease inhibitors LCMI I and ?M?-D2 fron Locusts migrator la; 
5 Protease inhibitor from Bacillus brevis; 

Marincstat ins C-l, C-2 and D from Aiteromonas sp.; and 
Host orotease inhibitor from, bacteriophage T4, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

10 Cereal alpha- amylase /trypsin Inhibitor family 

Alpha amylase/trypsin inhibitor CM1 from Triticum aestivum; 

Alpha amylase/trypsin inhibitor CM2 from Triticum aestivum; 

Alpha amylase/trypsin inhibitor CM3 from Triticum aestivum; 

Alpha amylase/trypsin inhibitor CM 16 from Triticum aestivum; 
15 Alpha amylase/trypsin inhibitor CM17 from Triticum aestivum; 

Alpha amylase inhibitor 0.19 from Triticum aestivum; 

Alpha amylase inhibitor 0.28 from Triticum aestivum; 

Alpha amylase inhibitor 0.53 from Triticum aestivum; 

Alpha amylase inhibitor WDAI-3 from Triticum aestivum; 
20 Alpha amylase/trypsin inhibitor CMA from Hordeum vulgare; 

Alpha amylase/trypsin inhibitor CMB from Hordeum vulgare; 

Alpha amylase/trypsin inhibitor CMC from Hordeum vulgare; 

Aloha amylase/trypsin inhibitor CMD from Hordeum vulgare; 

Alpha amylase inhibitor CME from Hordeum vulgare; 
25 Alpha amylase inhibitor BMAI-1 from Hordeum vulgare; 

Alpha amylase inhibitor BDAI-1 from Hordeum vulgare; 

Alpha amylase/trypsin inhibitor from Eleusine coracana; and 

Trypsin/f actor Xlla inhibitor from Zea mays, 

as well as inhibitors homologous therewith isolated from other 
30 sources than those explicitly mentioned. 
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Alpha -amylase /trypsin Inhibitor homologous to thaumatin 

Alpha-any lase /t rypsin inhibitor from Zea mays as well as inhibi- 
tors hcmoloqous therewith isolated f r o f her sources than Zea 
ma ys . 

5 Alpha -amylase/ subtil is in inhibitor family 

Al pha -amy lase /sub ti 1 isin inhibitor from Hordeum vulgare ; 
Al pha-ax.y lase/ subti 1 is in inhibitor from Triticurn aestivum; and 
Al pha-amy lase/subt i 1 isin inhibitor from Oryzae sative, 
as well as inhibitors homologous therewith isolated from other 
10 sources than those explicitly mentioned. 



Inhibitors of insect alpha -amylases 

Small protein inhibitor of insec; alpha-amylases 1, 2 and 3 from 
Sorghum bicolor milo as well as inhibitors homologous therewith 
:soia;ed from orher sources than Sorghum bicolor milo. 



15 Inhibitors of mammalian alpha-amylases derived from Streptomyces 
species 

Alpha-amylase inhibitor HAIM I from Streptomyces gr iseosporeus ; 
Alpha-amylase inhioitor PAIM I from St reptomyc&s ol ivaceovi ri - 
dis; 

20 Alpha-amylase inhibitor HAIM II frorr. Streptomyces gr iseosporeus; 
Alpha-amylase inhibitor PAIM II from Streptomyces ol ivaceovi ri - 
dis; 

Alpha-amylase inhibitor AI-3688 from Streptomyces aureofaciens; 
Alpha-amylase inhibitor Z-2685 from Streptomyces rochei; and 
25 Alpha-amylase inhibitor HOE-467A from Streptomyces tendae , 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 
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Trehalase Inhibitors 

Trehalase inhibitor from Per j plane ta americana as well as 
inhibitors homologous therewith isolated from ether sources. 

Polygalacturonase inhibitors 

5 Polygalacturonase innibitors from Phaseolus vulgaris and Pyrus 
communis as well as inhibitors homologous Therewith isolated 
from ether sources than those explicitly mentioned. 

Fiicosyltransf erase inhibitors 

Fuctimns from Rattus norvegicus as well as innibitors homoio- 
10 gous therewith isolated from other sources. 

Protein Jtina.se C inhibitors 

14-3-3 Protein beta/alpha from Bcs taurus, Ovis aries, Homo 
sapiens , and Ratzus norvegicus, respectively; 
14-3-3 Protein epsilon from Mus musculus, Gvis dries. Homo 
15 sapiens , and Rattus norvegicus , respectively; 

14-3-3 Protein eta from Bos taurus r Mus musculus, and Rattus 
ncrveg icus , respectively; 

14-3-3 Protein gamma from Bos taurus, Ovis aries, and Rattus 

norvegi cus, respectively; 
2C 14-3-3 Protein zeta/delta from Bos taurus, Mus musculus , Ovis 

aries, Home sapiens, and Rattus norvegicus, respectively; 

Hint protein from Bos taurus, Rattus norvegicus, Homo sapiens, 

Mus musculus , and Oryctolagus cuniculus , respectively; and 

14 kDa zinc-binding protein from Brass ica juncea and Zea mays, 
25 respectively, 

as well as inhibitors homologous therewith isolated from other 

sources than those explicitly mentioned. 
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cAMP- dependent protein kinase inhibitors 

cAMP- dependent protein kinase inhibitor (muscle/brain form) 
f rex How d sapiens , Oryctolagus cun i cu 1 us , Mus mus cuius , ana 
Rattus norvegicus, respectively, and 
5 cAMP-depenaen t protein kinase inhibitor (testis form) from Mus 
mus cuius , and Rattus no rveg : cus , respectively, 

as well as .inhibitors homologous therewith isolated from other 
s o \: r c e ^ t h n those exolicitlv mentioned. 

Cyclic nucleotide phosphodiesterase inhihitor 

10 Cyclic nucleotide phosphodiesterase inhibitor from Di c tyos tel i urn 
dxscoideum as well as inhibitors homologous therewith isolated 
from other sources. 

Protein phosphatase inhibitors 

Protein phosphatase inhibitor 1 from Homo sapiens , Oryctolagus 
15 cunicuius, and Rattus norvegicus, respectively; 

Protein phosphatase inhibitor 2 from Homo sapiens, Oryctolagus 

cunioulus, and Rattus norvegicus, respectively; 

Heat-stable protein phosphatase 2a inhibitor I1PP2A from Bos 

taurus and Homo sapiens, respectively; and 
20 Phosphatase RA?A inhibitor from Bacillus subtilis, 

as well as inhibitors homologous therewith isolated from other 

sources than those explicitly mentioned. 

TCD/MRS6 family of GDP dissociation inhibitors 

Secretory pathway GDP dissociation inhibitor from Saccha romyces 
25 cerev is iae; 

Rab GDP dissociation inhibitor alpha from Bos taurus, Homo 
sapiens , Mus muse ul us , and Rattus norvegicus, respectively; 
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Rao GL^F cissociazion inhibitor beta from Homo sapiens, Mus 
muscuLus, and Katus ncrvegiciis, respectively; 

Rhc GD?-a:s3GCia:io:i inhibitor 1 from Eos taurus, Homo sapiens, 
Caenorhabdi t j s elegans ana Cavia porcoiJus, respectively; 
b Rhc GD?-di350Ciation mhioitor from Saccharo/nyces cerevisiae ; 
Rho GD?-dissocia:nn inhibitor 2 from Ho/no sapiens and Mus 
musoulus, respectively; and 

Rno GDP-dissociation inhibitor from Mus musculus, 
as well as inhibitors homologous therewith isolated from other 
10 sources than those explicitly mentioned. 

ATTase inhii>i tors 

Mitochondrial ATPase inhibitors from Bos taurus , Caenorhabditis 
elegans, Picnia jadinii, Sus scrofa, Rattus norvegicus, and 
Saccharomyces cerevis lae, respectively, and 
15 Sodium/potassiurr. ATPase inhibitor from Sus scrofa, 

as well as inhibitors homologous therewith isolated from, other 
sources than those explicitly mentioned. 

Phospholipas* A2 Inhibitory proteins 

Annexin I from Bos taurus, Homo sapiens, Mus musculus, Sus 
20 scrofa, Rattus norvegicus , Oryctolagus cuniculus, Cavia cutlari, 
Gallus gallus, and Columba livia, respectively; 
Annexin III from. Homo sapiens and Rattus norvegicus, respec- 
tively; 

Annexin V from Homo sapiens, Mus musculus, Rattus norvegicus , 
25 and Gallus gallus, respectively; 

Uteroglobulm from Oryctolagus cuniculus and Lepus capensis, 
respectively; and 

Phospholipase A2 inhibitor from Trimeresurus f lavovir idis , 
as well as inhibitors homologous therewith isolated from other 
30 sources than those explicitly mentioned. 
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Rihonuc lease inhibi tars 

Bars car iron Bacillus amyloliquefacieris; 

Riboruc lease inhibitor from Heme sapiens, Sus scrota, and Razz us 



3 t herewith isolated from other sources than those explicitly 
nent ior.ed . 

Bacterial RNA polymerase inhibitors from Bacteriophages T3 and 
T7 as well as inhibitors homologous therewith isolated from 
10 ether sources. 

DNA-entry nuclease izihibitors 

Competence protein J from Bacillus subtil is as well as inhibi- 
tors homologous therewith isolated from other sources. 

Beta -lactamase Inhibitors 

15 Beta-iactamase inhibitor from S t reptomyces clavul igerus as well 
as inhibitors homologous therewith isolated from other sources. 

To this list of enzyme inhibitors can be added functionally 
related inhibitors such as the inhibitor of calcium transport 
seminalplasmin from Bos taurus and Mus musculus , respectively, 
20 as well as inhibitors homologous therewith isolated from other 
sources - 

To the above list of suitable scaffold molecules to be used in 
the invention should be added other agents having an effect on 
the activity of enzymes. One interesting candidate is thioredox- 



norvegi cus , 



respectively, as well as inhibitors homologous 



25 in. 
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Ail of the above-listed scaffold molecules car. be substituted 
with an effective part of the complete molecule. 

P-onerties of the scaffold molecule 

Since the oeptiae library presumably can reach every compartment 
5 of a cell, it is beneficial if the scaffold enzyme inhibitor 
protein is not too large, and that it is stable towards e.g. 
proteolytic attack and insensitive to the reducing environment 
insiae eukaryotic cells. Hence, its function should preferably 
not be dependent on the formation of disulphide bridges, since 

10 these are not formed in the cytosol or nucleus of such cells. In 
addition the scaffold protein should contain one or more exposed 
loops in which peptides can be inserted without markedly chan- 
ging the structure or stability of the protein. On this basis 
the chymotrypsin inhibitor 2A (cf. the discussion below) is a 

15 suitable scaffold but ether scaffolds such as the examples 
mentioned below could also be used. 

The members of the peptide library introduced into target cells 
can effect change of the target ceils' phenotype either by 
direct or indirect ( t ransdomi nant ) interaction. However, the 
20 rrethod of the invention renders possible the identification cf 
not only the peptide or ribonucleic acid sequence which is 
responsible for the direct or indirect effect on phenotype, but 
also allows for the purification, isolation and identification 
of the target molecule with which the peptide interacts. 

25 In the present specification the barley-derived chymotrypsin 
inhibitor 2A has been used by way cf example, mainly because 
this protein is very well-characterized and extremely stable. It 
will be understood, however, that the use of this enzyme inhibi- 
tor in the Examples in no way should limit the scope of the 

30 present invention. 
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Barley chymotrypsin innibiL :r 2 A belongs to a large family of 
homologous protease inhibitors mainly ilound in plants. This 
family includes barley chymot r yps i n inhibitors 1A, IB, 2 A and 
23, potazo inhibitor I , wo u no -induced tixato inhibitor I, 
5 e thy 1 ene- respons i ve tomato inhibitor, wild tomato fruit inhibi- 
tor I, a subtil isin inhibitor frox. broad bean, adzuki bean 
subtilisin inhibitor, pumpkin trypsin /Hageman factor inhibitor, 
bitter gourd inhibitor, protoplast-specific trypsin inhibitor 
from Nicotiana sylvestris, tobacco subtil ism inhibitor, ama- 
1C ranth trypsin/subtilisin inhibitor, and beach canavalia suboili- 
sin inhibitor. The only member of this inhibitor family that is 
of non-plant origin is the leech e las tase /cat heps in G inhibitor 
eglin C. 

CI-2A was originally purified iron the endosperm of the high-ly- 

:5 sine barley Hiproly and shown to be a tight binding inhibitor of 
the microbial subtiiisins Carlsberg cna Novo as well as of 
chynotrypsin. The N-terminal amino acid residue in CI-2A has a 
blocked amino group as direct amino acid sequencing was unsuc- 
cessful. However, most of the amino acid sequence of CI-2A has 

2C been determined at the protein level. In addition, it was shown 
that CI-2A purified from oarley is N-termina 1 ly processed either 
during synthesis and storage in the endosperm or in the process 
of purification. The absence of the 17 N-terminal amino acid 
residues does not influence the complex formation of CI-2A with 

25 subtiiisins. Combining the results of amino acid sequencing and 
cDNA sequencing it has been deduced that CI-2A consists of S3 
amino acid residues in a single polypeptide chain containing no 
disulfide bonds. The blocked N-terninal amino acid residue is 
serine. The reactive site in CI-2A has been determined to be the 

30 Met59-Glu60 peptide bond and the residues in the region Ile56- 
Arg62 have been demonstrated to be involved in the intermolecu- 
lar contacts between inhibitor and protease. 




Trie t nree-d irrer.s 1 ona 1 structures of uncomplexed CI-2A as weil as 
of C1-2A complexed wi:n suotiiisin Novo are known from X-ray 
crystallography. The three-dimensional structure of CI-2A has 
also been determined using NMR spectroscopy, revealing that the 
o reactive loop of CI-2A is dynamic. CI-2A consists of a single a- 
helix docking against four (5-st rands . The surface loop stretches 
across the free side of the sheet and is composed of eight 
residues: Gly54-?yr61. In contrast to most enzyme inhibitors, 
CI-2A lacks disulfide bonds as well as glycosylat ion sites. In 
10 the structures determined, only the 64 C-terminai amino acid 
residues are defined (L20-G83) ; this truncated version retains 
the functionality of the native protein. Comparing the complexed 
form with the two uncomplexed forms of CI-2A reveals few diffe- 
rences. The most notable difference is that the reactive site 
15 loop seems to have a less ordered structure in the uncomplexed 
forms than in the complexed form. The t h r ee - dimens i ona 1 struc- 
ture of CI-2A in complex with subtilisin Novo has also been 
compared to the three-dimensional structure of eglin C in com- 
plex with subtilisin Carlsberg. The two homologous inhibitors 
20 have highly similar secondary and tertiary structures. 

Recombinant variants of N-terminaliy truncated CI-2A 

(CI-2A (L20-G83) ) and CT-2A with an N-terminal Asp-Pro extension 

have been widely used to study the folding and stability of 

CI-2A. The structure of CI-2A in complex with subtilisin Novo 
25 has revealed that the number of intermolecular contacts between 

inhibitor and protease in the P4-P1 region ( I le5 6-Met 5 9 ) of the 
inbiottor are much larger than in the Pl'-P3' region 
(Glu60-Arg62) . 

Further aspects of the invention 

30 Having identified a modulator according to the above-detailed 
methods of the invention it is normally of interest to provide 
large quantities thereof for the purposes of further research 
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and development:, including possible identification of the target 
rno lecuic wroi w 1 m. c h the .Toduiator pnysically interacts. 

Therefore, the invention also pertains to a method for the 
preparation, of a replicabLe expression vector, the method com- 
5 prising trie steps of identifying a modulator by the methods of 
the invention, ana subsequent 1 y 



15 Such methods are widely known in the field of genetic engineer- 
ing and molecular biology. The skilled person will find suitable 
guidance in e.g. Sambrook J , Fritsch EF, Maniatis T. 1989. 
"Molecular cloning: A laboratory manual", 2nd ed. Cold Spring 
Harbor Laboratory, Cold Spring Haroor, N.Y. 

20 In this part of the invention it is preferred that the promoter 
is inducible; constitutive promoters are not excluded though. 
Depending on the choice of host cell to carry and express the 
vector, the promoter is selected from the group consisting of a 
bacterial promoter, a fungal promoter such as a yeast promoter, 

25 and a mammalian promoter. The vector can be in the form of a 

plasmid, a phage, a cosmid, a minichromosone, or a virus, again 
depending on choice of host cell and other considerations apply- 
ing for the specific case. However, it is most preferred that 
the expression vector is capa'ol& of being integrated into the 

30 genome of a suitable host ceil, since the expression therein 
will then be more stable over time than is the case with non- 
chromosomal transformation of the host cell. 
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isolating or synthesizing a nucleic acid sequence which 



encodes the modulator, and 



10 



engineering a replicable expression vector comprising an 
ooeron which comprises, in the 3* -3' direction and in 
operable linkage, 1) a promoter for driving expression of 
the nucleic acid sequence, 2) optionally a nucleotide 
sequence encoding a leader peptide, 3) the nucleic acid 
sequence, and 4) a termination signal. 



58 

Weil -known vector systems are based on bacterial plasm.id p3R322, 
X-phage, and yeast plasmid YKp7 , but other suizable and feasible 
choices are known to the skilled person. 

After having provided a suitable expression vector as outlined 
5 above, it is preferred that the modulator is produced by trans- 
forming a suitable host cell with an expression vector prepared 
as described above. Such a nost cell can be bacterial (e.g. E . 
coli, Bacillus subtilis, or any other suitable bacterial host 
cell), a fungus (e.g. a yeast cell such as Saccha romyces 
10 cerevisiae cr Piccia pastoris) or a plant, insect or mammalian 
cell (which can be any of the above-discussed ceils or cell- 
types suitable for use as the "substantially identical cells" in 
the method of the invention) . 

After having provided a producer cell line as described above it 
15 is now possible to produce the modulator of the invention by 
growing the transformed ceil prepared as described above in a 
culture medium under conditions which facilitates expression by 
the cell of the randomly noaified nucleotide sequence, and 
subsequently harvesting the expression product from the cell 
20 and/or the culture medium. Alternatively, the modulator can be 
oroouced by synthesizing the modulator by means of chemical 
synthesis on the basis of the sequence determined in step (e) . 
In the case that the modulator is a peptide, the well-known 
techniques of solid- or liquid-phase peptide synthesis can be 
25 employed and also if the modulator is a ribonucleic acid, me- 
thods for synthetic production thereof are readily available. 

An important part of the invention pertains to the isola- 
tion/identification of the target bioir.olecules which are enga- 
ging the modulator, the identification, isolation and production 
30 of which is described above. Hence an important part of the 

invention pertains to a method for isolating a target biomoie- 
cule, the method comprising providing a modulator according to 
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-he neihcds described herein and subsequently using the modula- 
tor as an affinity I : gand in an affinity purification step so as 
t:> isolate the target biomoiecule from a suitable sampie. The 
affinity purification step can e.g. c e an affinity chrorr.ato- 
5 graphic step, an affinity Tass spectrometry step, or a co- 
immunoprecip 1 tat ion step. However, any suitable method for 
af L mity-cased isolation/purification can be employed. 

Alternatively, the modulator can be used as a probe against a 
cJNA library derived from the substantially identical cells or 
10 as bait in a two- or three-hybrid system (bacterial, fungal or 
mammalian) . 

It is preferred that the target biomoiecule is a peptide or a 
nucleic acid, since this also allows of sequence determination 
thereof. 

15 The potential use of such a target biomoiecule is to employ it 
in a drug-development program. For this purpose it is often 
useful to resolve the 3-dimensior.al structure of the target 
biomoiecule (by means of methods available, e.g. X-ray diffrac- 
tion studies, NMR analysis, etc) . 

20 Having isolated a target biomoiecule as described above, the 
invention allows for the rational selection of a chemical com- 
pound to be used as a lead compound in drug development, the 
method comprising the steps of 

assaying a library of chemical compounds for interaction 
25 with a target biomoiecule which has been de novo isolated 

according to the methods of the, and 
- selecting compounds which interact significantly with the 

target biomoiecule. 
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This approach can be refined by initially ideniifying the mem- 
bers of the library by methods of structure-based or non-struc- 
ture based conpu:er drug-modal 1 ing . Suitable non-structure based 
methods are disclosed in US 5,307,287 and US 5,025,388; a method 
known as CoMFA; . An alternative is HASL (Hypothetical Active 
Site Lattice; Hypothesis Software) . Both these methods are based 
on 3D-QSAR. A feasible structure-based approach is e.g. dis- 
closed in WO 95/06293. 



Finally, a very important part of the present invention pertains 
10 to a method for the preparation of a medicinal product, the 
method comprising tne steps of 

a) selecting a lead compound by the methods of the invention 
described above, 

b) performing pre-clinical tests with the lead compound in 
15 order to assess the suitability thereof as a medicinal 

oroduc t w 

c) entering, if the lead compound is deemea suitable in step 
(b), clinical trials using the lead compound in order to 
obtain market authorization for a medicinal product in- 

20 eluding the lead compound as a pharmaceutical ly active 

substance, and 

d) upon grant of a market authorization, admixing the lead 
comoound with a pna rmaceut ical ly acceptable carrier, 
excipient or diluent and marketing the thus obtained 

25 medicinal product. 



The above-outlined method should of course take into conside- 
ration all necessary requirements in order to meet GCP and GMP 
standards. At any rate, this method is completely depending on 
the initial provision of the modulator identified according to 
3C the invention. 
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LEGEND TC THE FIGURE 

Fig. 1: Functional expression of CT-2A in NIH3T3 cells. 
Subtilisin was incubated with increasing airicunts of the indi- 
cated ceil extracts and subsequently assayed for proteolytic 
5 activity. Facn reaction with a given extract concentration was 
determined in trioiicate and the shown velocities are based on 
the mean values . 

PREAMBLE TO EXAMPLES 

In the following, the present invention is illustrated by way of 
10 example wherein CI -2 A is used as starting point for the Cell- 
Screen technique adapted according to the invention so as to 
allow intracellular expression of a scaffold protein inhibitor 
which is randomly modified in the active site. This example is 
non-limiting, in the sense that other suitable protein inhibi- 
15 tors o: enzymes could oe used instead of CI -2A . The skilled 
person can readily perform the necessary substitutions and 
modi ficat ions necessary in order to apply the principles exem- 
pli fied below using other protein inhibitors of enzymes as 
starting point . 

20 EXAMPLE 1 

Expression of CI-2A truncate in NIH-3T3 cells 
Const ruct ion cf CMVbipep/CI-2A 

The CMVbipep/CI-2A construct was created by insertion of a CI-2A 
der i ved cDNA fragment that encodes amino acids 21-84 f rom the 
25 wild type CI-2A sequence (cf. the numbering in SEQ ID NO: 1) and 
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an N-terxir.ally located methionine residue into the BamHl and 
Xhol sites within the CMVbipep retroviral vector (a standard 
retroviral vector described in WO 96/38553, cf. Fig. 1 therein). 
The 5117 base pair BamttI / Xhol fragment cf CMVbipep was gel 
purified and extracted using a QIAquick gel extraction kit 
(Qiagen) according to the manufacturer's instructions. The 
fragment of the CI-2A gene was amplified from a plasmid contain- 
ing the entire CI-2A encoding DNA sequence by use of the primers 



5' -cgggatccatgaaGAcagagt.ggccagaG and 
10 5 ' -CGCTCGAGTCAGCCGACCCTGGGGACCT 

in a standard polymerase chain reaction (PCR) . The amplified DNA 
fragment was purified using the QUAquick PGR purification kit 
(Qiagen) according to the manufacturer's protocol and subse- 
quently digested with BamHI and Xhol and purified as described 
15 above. An approximately 10-fold molar excess of the CI-2A frag- 
ment was incubated together with the CMVbipep derived fragment 
under standard ligation conditions (New England biolabs) . Posi- 
tive clones were identified and confirmed by DNA sequencing. 



Transduction of Nj.H-373 cells 

20 CMVbipep and CMVbipepCI -2A were transfected into the BOSC pack- 
aging ceil line using a CaPO, co-precipitation method. BOSC 
packaging cells (also known as BOSC23 cells, cf . WO 94/19478) 
were diluted to 5 * 10 5 cells/cm 2 and incubated at standard cell 
culturing conditions in Duibecco' s modified eagle media (DMEM, 

25 GibcoBRL) including supplements (10% fetal calf serum, L-gluta- 
mir.e, 50 mg/ml penicillin/streptomycin (complete DM EM, Gibco- 
BRL) . To prepare the BOSC cells for transfection they were 
washed once in complete DMEM 2 hours before the transfection. 
The CaPO, co-precipitated mixtures were prepared by diluting 10 

30 mg CMVbipep/CI-2A or CMVbipep with 10 mg salmon sperm DNA in 450 
ml ddH : 0 and adding 50 ml 2.5 M CaCl ? . These solutions were 
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slowly aided to SOD nl 2 * HEPES -bu t i erea saline p*l 7.05 (230 mM 
i.'aCl, l.S mM Na ; HPCt, 50 mM KEPE5 /NaOH pH '/.Co) while gently 
shaking followed by two to five minutes of incubation at 25°C 
before adding tne precipitate tc the prepared 303C cells. After 
S 24 hours of incubation, the cells were washed twice in PBS (137 
mM NaCI, 2.7 mM KC1 , 8 . 3 mM Na 2 H?C\ 1 . 4 mM KH : P0_} and further 
cultivated in 10 ml DM EM Including supplements for another 24 
hours. The media from, tne transfected BOSC cells were collected 
and diluted from 10 to 10" fold in complete DM EM including G 

10 mg/mi Poiyoene. NIH3T3 cells that had been diluted into iO 4 

ceils/cm' ana incubated at standard cell culture conditions for 
24 hours in complete DMEM were incubated with the virus-contain- 
ing media for 24 hours at standard conditions. After this incu- 
bation period the transauced cell were washed twice in PBS and 

IS incubated in DMEM including supplements containing 0.6 mg/ml 
Geniticm (GibcoBRL) . 

Preparation of cell extracts 

The CMVbi?ep/CI-2A or CMVbipep transduced NIH3T3 cell were each 
harvested from two confluent TI75 cell cultjring bottles by 

20 incubation with 5 ml 0.5 * Trypsin-EDTA solution 

(GibcoBRL) /plate for 10 min at 37°C. The recovered cells were 
diluted 1:1 in complete DMEM and subsequently collected by 
cent r i f ugat ion, washed twice in 10ml DMEM and twice in 1 ml PBS. 
Finally, the cells were suspended 100 ml PBS. The cells were 

25 rendered permeable by three cycles of freezing in liquid nitro- 
gen and thawing by incubation at 37°C and subsequently centri- 
fuged at 20000 x g for 15 minutes cell to remove cell debris. To 
inactivate endogenous protease activity the extracts were incu- 
bated at 65°C for 15 minutes and r e-cent r i f uged . 

30 Subtilisin assay 
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The indicated cell extracts were dilutee to the indicated 
amounts in 10 nl PBS and added to 25 ~I C.l M Tris/HCl pH 8.6 
containing 5 x 13 _fi M Subtil: sin Ca rlsberg (Sigma). After incuba- 
tion at 25°C for 3C minutes, 25 ml C.l M Tris/HCl pH 8.6 con- 
taining 5 rr.M N-succinyl-Ala-Ala-Prc-Phe-p-Nitroanilide (Sigma) 
was added and the substrate conversion was followed oy measuring 
the absorbance at ^05 nm every two minutes until the reaction 
had reached exhaustion. 



Results/discussion 



10 A potential scaffolc for intracellular presentation of a peptide 
library expressed by virtue of. a retroviral vector should not 
have any significant effects on neither the retroviral replica- 
tion cycle nor the viability of mammalian cells. Furthermore, it 
must retain a stable structure. In order to test whether CI-2A 

15 fulfill these requirements CMVoipep/CI -2A was constructed con- 
sisting of the cDNA that encodes the CI-2A amino acids 21-84 
inserted into the CMVbipep retroviral vector. This CI-2A expres- 
sion construct was in parallel with CMVbipep transiently 
transfected into BCSC packaging cells and the produced viruses 

20 were transduced into NIH-3T3 cells. 

The titer of both CMVbipep and CMVbipep/C I -2A, determined as 
geneticin resistant colonies, were in both cases found to be 10- 
infectious virions/ml media . This demonstrated that the presence 
of the CI-2A gene within the viral genome did not affect the 
25 viral replication cycle to any detectable extent. 

To detect whether any expression of CI-2A occurs in the trans- 
duced cells tne potential of cell extracts obtained from the 
NIH3T3 cells transduced with either CMVbipep or CMVbipep/CI -2A 
to inhibit the proteolytic activity of subtilisin was deter- 
30 mined. This was done by incubation of various amounts of the 
cell extracts with subtilisin and subsequently detecting the 
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residual protease activity using a chromogenic substrate (cf. 
Fig. 1) . A complete block of subtiiisin activity was observed 
when t.ie reaction wis per for:r.ed in the presence of between 10 
and 0.2 ml cf the extract derived from tne CMVo i pep/ CI - 2 A trans - 
5 duced NIH3T3 cells whereas approximately 50 s of the subtil is in 
activity was maintained when C.l nl extract was present. If the 
amount cf CtMVb ipep /C I - 2 A extract was further decreased, the 
subtiiisin activity was net detectably reduced. The specificity 
of the CI-2A mediated inhibitory effect was demonstrated by the 
10 observation that even as much as 10 ml of extract from CMVbipep 
transduced control KIH3T3 cells had no effect on the subtiiisin 
activity. 

The data presented demonstrates that C1-2A does not affect the 
viral packaging and integration processes and that it can be 

15 expressed in detectable amounts from the CMVbipep vector. The 

ability of CI-2A to inhibit the proteolytic activity of subtiii- 
sin furthermore suggests that CI-2A adopts a functional struc- 
ture within mammalian cells. This is of great importance for the 
use of CI-2A as a scaffold since it then becomes possible to 

20 utilise the impressive amount of structural information avail- 
able on CI-2A to ensure that a peptide library is properly 
presented. The possibility of expanding the CMVbipep transduced 
NZH3T3 cell indicates that the presence of CI-2A does not affect 
to ceil viability, further supporting that CI-2A is a suitable 

25 scaffold for intracellular presentation of peptides. 

EXAMPLE 2 

Chymotrypsin inhibitor 2A (CI-2A) as scaffold in the CellScreen 
technology 

As demonstrated in Example 1, CI-2A can be expressed in a func- 
30 tional form in mammalian cells. The provision of a functional 
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system for displaying randomized peptide sequence using CI-2A as 
a scaffold is thus relatively uncomplicated to envisage. 

It has been shown that insertions of 1, 9, 11 and 13 residues 
between the Met40 and Glu41 (corresponding to the Met59 and 
h Giu6C in the native molecule) in CI-2A have a minimal effect on 
the stability and folding rates of the protein. The loop there- 
fore seems to be suitable for insertions of residues for the 
purposes of the present invention. Moreover, the truncated form 
of CI-2A has been found to fold by interactions of key residues 
10 in the - and C-terminal domains of the protein irrespective of 
whether the loop is cleaved. 

CI-2A is thus an extremely stable protein that has the advantage 
of being small, having no disulfide bridges, no glycosylat ion 
sites and best of all - a loop of eignt amino acids that can be 
15 manipulated without any major stability loss. Tc this should be 
added that CI-2A is of plant: origin, and therefore its use in 
prokaryotic and mammalian systems will not be obscured by the 
presence of highly homologous proteins. 

Especially the lack of aisulphiae bridges qualifies CI-2A as a 
20 scaffold for displaying diversity in the nucleus and/or cytosol- 
ic compartment of the ceil. As a scaffold, the 64 residue ver- 
sion of CI-2A could be used, since the N-termir.al 19 amino acid 
residues do not have any known function. Peptide libraries could 
then be inserted into the active-site loop or several amino acid 
25 residues in the loop could be substituted with a peptide li- 
brary. A peptide library inserted in the active-site loop of a 
protease inhibitor would very likely contain protease inhibitors 
of several new specificities but, as detailed above, also se- 
quences reactive with other biomolecules in vivo. 

30 By fusing the C-termmai of the truncated CI-2A to the surface 
protein of the bacteriophage pill, it is contemplated to use 
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CI -2 A scaffolds for se leer ion by phage display, which would make 
it possible to er. rich the library for binders zo intracellular 
components oefore expressing the library i n t r ace 1 1 u 1 a r 1 y in 
maccr.alian ceils. In order to direct the CI-2A scaffold to dif- 
5 ferenc ccx-pa r tmer.t s of the cell retroviral vectors holding 
different leader sequences will be constructed, cf . the above 
discussion . 

To be able to isolate the CI-2A scaffold - and thereby the 
target molecule to which it binds - from selected cells, a 

10 peotice tag will be engineered to the N- terminus of the trun- 
cated CI-2A. We are currently considering the following tags: 
his-tag, Strep-tag or FLAG-tag. however, for future use we are 
planning to use antibodies with specificity to CI-2A or to a 
peptide sequence of CI-2A. Peptide libraries of different sizes 

15 (six- to nine-rr.ers) would either be inserted between Met40 and 
Glu4 1 or amino acid residues in the loop wculd be substituted 
with a peptide library. 
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SEQUENCE LISTING 



C E N E R A L I N FO P>' AT I ON : 

AFPLICANC: 

{A: NAKE: M&E 3iotec:i A/5 

{3; STFEET: Kogle AUe 6 

;C: CIT V : Htzrsnclm 

iE) COUNTRY: jenmark 

(F; POSTAL CODE (ZIP): DK-2970 

(Gi TELEPHONE: +4 5 4 5 16 2 5 2 5 

■;:-:> TELEFAX: +45 45 16 25 CC 

TITLE Cf INVENTION: Novel method :cr tr.e identification cf iigano 
anc target oicmol ecjles 



in:: N *J M 3 E. R OF SEQUENCES: 2 



... fiv; COMPUTER READABLE FORM: 

1, [A) MEOIUK TYPE: Floppy disk 

- (b; COMPUTER: IBM PC ccmp^ihic 

' ; ; ,;c: CREMATING SYSTEM: PC- DCS / MS-DOS 

-1? jD} SOFTWARE : Patently Release *1.0, Version #1.30 (EPG) 

■« 

i?) INFORMATION FOR 5EQ ID NC : 1: 

• U) SEQUENCE CHARACTERISTICS: 

\. ;a) LENGTH: 451 base pairs 

[B! TYPE: nuclei: acid 
7* ( C ) ST RAN DEDi\ T ESS : double 

7 ;D) TOPOLOGY: linear 

.7 (u! MOLECULE TYPE: cCNA 

.;-■/:} O R I G I >: AL SOU RC E : 

(A) ORGANISM: Hordeum vuigare 
fix; FEATURE: 

;aj na>;e/:<ey: ccs 
(b) location: 35. .3 36 



;ix» FEAT C RE : 

v A • NAME/KEY: mat_ peptide 
(B) LOCATION: 38. . 336 



(ix;. FEAT L RE : 

( A } NAME/ KEY : Duplication 
(S) LOCATION: 24 9 

(D) OT-iER INFORMATION: C- is G or T 



■;ix} FEATURE: 

;A) NAME /KEY : Duplication 
(B) LOCATION: 2 52 
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A or C 



( 'J ) 07 fi E R IN TOPMAT I C N : A 

NAME /.KEY : D jpl i 
LOCATION: 2 79 

CI HER INEOPMATITN : 0 is C or T 

(xi) sequence description: seq id vo : 1: 
cattaaactg atgatatgac agttcaagat ctcacagtca ca1 cg ggg at ctaa.tcagtc 
teacaggaag cgagcg~aag aa:g atg agt tca gtg gag aag aag ctc gag 

Met Ser Ser Val Glu Lys Lys Pro Glu 



60 



ega GTl> aac ago g 



;r GAG CGT 



^AC CTG AAG AC A GAG TGG 



Uy Val Asr. Thr Gly Ala Gly Asp Arg His Asn Lea Lys _hr Glu Trp 
i C 15 2 0 



OCA GAG tt: GTG GGG 



GAG 



AG GCC AA Z 



;TG ATT CTG 



Pro Glu Leu Val GLy Lys Ser Vdl Glu Glu Ala Lys Lys Val l_e Leu 



33 



CCA GAG CCG CAA ATC ATA GT 



GAG GAG 
Gin Asp Ly.s Pro Glu hi a Clr 
45 



:tg co: 



40 



ZGG AC A AT 7 



lie Va] Leu Pr: Val GLy Thr ,.ie 



5C 



55 



^: 1 -o Ml 



Val Thr 



OG ATG GAA TAT CGG ATC GAG GGG GTC GGG CCG TTT GT1 



AAA 



■Cer. Glu 

60 



Tyr Arg _le Asp Arg Val Arg Leu to 



CTC GAG .AAG 
Leu Asp As:) 
7 5 



GCC 



GTC GGG AGG GTC GGC 



e v a i -H3o 



:CAAOCT? GAGAGCTAGC 



Ala Glr. Val Pro Arg Val Cly 
SO 



CTGCTGCTGG GGTGTATGTA TTCCAGCTTC ACCATCTGTT CTTGGCTATA GCAAGAT7GA 

G A TTT AT AAA T CAT A T AC AA TA-AGAGTTGC TGCGG 

(2; INFORMATION EOR SEQ 10 NO: 2: 

;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 4 arrir.o acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Li) MOLECULE TYPE: protein 

(xi « SEQUENCE DESCRIPTION: SEC I D NO: 2: 

Me^: Ser Ser Val Glu Lys Lys Pro Glu Gly Val Asr. Thr Sly Ala Gly 
-11 5 10 15 

A.sp Arg His A. sr. Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser 
2C 25 30 



237 



3C3 
356 

4 16 
451 
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Val Glu Glu Ala Lys Lys Vol II; 

35 



) : C 1 .-i Asp Lys Pro Glu A 1 =j G 1 r 



lie lie Va_ Lej Fro Vai Gly Thr He Val Thr Met Glu Tyr Arg lie 
50 5 5 60 



Asp Arq Val Ar^ 
65 



?he Val Asp J.ys Leu Asp Asr. He Ala G_r. Val 
73 75 



Pro Arr Val Gly 
3C 



CLAIMS 

1. A method for idcr;n:y inq a:: 1:: vivo ac;;ve modulator of 
activity of a target enzyme, the method comprising the steps of 

(a) :.)repa:ing a pool of expression vectors, each vector of 
5 said pool containing at least one member from a library of 

randomly modified nucleotide sequences derived iron a 
parent nucleotide sequence encoding a parent peptide or 
parent ribonucleic acid which modulates the target enzyme 
activity, 

10 (b) transforming a population of substantially identical cells 
with said vectors of said pool so as to obtain transformed 
cells, said substantially identical cells being ones which 
harbour the target enzyme, 

(c) culturinq saia transformed cells under conditions 

15 facilitating expression of saia randomly modified nucleo- 

tide sequences, 

(d) examining said transformed cells and isolating transformed 
cell(s) wherein the activity of the target enzyme is 
modulated, and 

20 (e) determining said randomly modified nucleotide sequence of 
said vector present in cell (s) isolated in step (d) and/or 
determining the amino acid sequence or the ribonucleic 
acid sequence of the expression product encoded by said 
randomly modified nucleotide sequence. 

25 2. The method according to claim 1, wherein the randomly modi- 
fied nucleotide sequences consist of 1) an invariable part of 
the parent nucleotide sequence, and 2) random nucleotides. 
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3. The method according to claim 2, wherein the invariable part 
of the parent nucleotide sequence encooes a scaffold portion of 
the parent peptide or of the parent ribonucleic acid which 
serves to stabilize said polypeptide fragment or ribonucleic 
aci d fragment . 



4. A method for identifying a modulator in the form of a biolo- 
gically active oolypeptide fragment or ribonucleic acid fragment 
which is capable of detectably modulating, in vivo, a phenotypic 
trait in a ceil, the method comprising the steps of 



10 (a) preparing a pool of expression vectors, each vector of 

said pool containing at least one member from a library of 
randomly modified nucleotide sequences derived from a 
parent nucleotide sequence encoding a parent peptide or 
parent ribonucleic acid which in vivo modulates activity 

15 of a known enzyme, wherein the randomly modified nucleo- 

tide sequences comprise 

- an invariable part encoding a scaffold portion of 
the parent peptide or of the parent ribonucleic 
acid, said scaffold portion serving to stabilize 

20 said polypeptide fragment or ribonucleic acid frag- 

ment , and 

- random nucleotides, 



(b) transforming a population of substantially identical cells 
with said vectors of said pool so as to obtain transformed 

25 cells, 

(c) culturmg said transformed cells under conditions 
facilitating expression of said randomly modified nucleo- 
tide sequences, 



(d) examining said transformed cells and isolating transformed 
30 cell (s) wherein the preselected phenotypic trait is modu- 
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lated by the presence of the expressed randomly modified 
nucleotide sequence, and 

(e) determining said randomly modified nucleotide sequence of 
said vector present in ceil(s) isolated in step (d) and/or 
determining the amino acid sequence or the ribonucleic 
acid sequence of the expression product encoded by said 
randomly modified nucleotide sequence. 

5. The method according to any of the preceding claims wherein 
the substantially identical cells are prokaryotic cells. 

6. The method according to claim 5, wherein the prokaryotic 
ceils are bacteria selected from the group consisting of Bacil- 
lus sop., Clostridium spp., Corynebacter lum sop., Staphylococcus 
spp., Streptococcus sop., Escherichia coil, Serratia marcescens, 
Klebsiella spp., Proteus spp . , Curooacter spp., Salmonella 
spp., Shigella spp., Pseudomonas spp., Aci ne tobacter spp., 
Aeromonas spp., Plesiox.onas spp., Yersinia spp., Prancisella 
tuiarensis, Vibrio spp., Campylobacter spp., Helicobacter pylo- 
ri, Haemophilus spp., Bordetella spp., Brucella spp., Neisseria 
spp., Treponema pallidum, Leptospira interrogans; Borrelia spp., 
Legionella pneumophila, Listeria monocytogenes, Mycobacterium 
spp., Treponema pallidum, Chlamydia trachomatis, Actinomyces 
spp., Rickettsia spp., .Mycoplasma spp., Lactoccccus spp., and 
Lactobacillus spp. 

7. The method according to claim 6, wherein 

the Clostridium spp. is selected from the group consisting of C. 
botulinum, C. difficile, C. perfringens, and C. tetani 
the Corynebacter ium spp. is selected from the group consisting 
of C. diphtheriae, and C. pyogenes; 

the Bacillus spp. is selected from the group consisting of B. 
anthracis, B. subtiiis and B. cereus; 
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the Staphylococcus spp . is selected from the group consisting of 
S. aureus and 3 . albicans; 

the Streoiococcus spp. is selected from the croup consisting of 
S. pneumoniae, S. pyogenes, and S. agalactiae; 
5 the Klebsiella spp. is K . pneumoniae; 
the Proteus spp. is P. mirabilis; 

the Citrobacter spp. is Citrobacter freuncii;the Salmonella spp. 
is selected from the group consisting of S. typhi, S. typhimuri- 
um, S. shottmulleri and S. paratyphi; 
10 the Shigella spp. is selected from the group consisting of S. 
dystenteriae, S. flexneri, S. boydii, and S. sonnei; 
the Pseudomonas spp. is selected from the group consisting of P. 
aeruginosa, P. pseudoma i lei , and P. mallei; 

the Yersinia spp. is selected from the group consisting of Y. 
15 pestis, Y. enterocolitica , and Y. pseudotuberculosis; 

the Vibrio spp. is selected from the group consisting of V. 
cholerae and V. pa rahaemol yt icus ; 

the Campylobacter spp. is selected from the group consisting of 
C. jejuni and C. coli; 
2C the Haemophilus spp. is selected from the group consisting of H. 
influenzea, H. parainf luenzae, and H. aegyptius; 

the Bordetella spp. is selected from the group consisting of B. 
pertussis, B. parapertussis, and B. bronchiseptica; 
the Neisseria spp. is selected from the group consisting of N. 
25 gonnorhoeae and N. meningitidis; 

the Borrelia spp. is selected from the group consisting of B. 
burgdorferi sensu stricto, B. garinii, 3. afzelii, and 3. 
recurrent is ; 

the Mycobacterium spp. is selected from the group consisting of 
30 M. tuberculosis, M. bovis, M . africanum, M. kansasii, and M. 
leprae; and 

the Mycoplasma spp. is M. pneumoniae. 

8. The method according to any one of claims 1-4, wherein the 
35 substantially identical cells are eukaryotic cells. 
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3. The method according to claim S , wherein the eukaryjtic ceils 
are selected frcrr the cr:»up consisting of fungal ceils, proto- 
zoan ceils, rr.a 1 c e lis, and p 1 a n 7. z e 1 1 s . 

10. The met nod according to claim 9, wherein tine fingai cells 
5 are selected frcrr. the group consisting of Epide rmophyt on spp., 

Trichoohyton soo. , Mi crosoor-jn sod. , Candida albicans, Philo- 
phora spp., Co c z i d i o i de s immitis, Histoplasma capsulatum, 
Blastomyces dermat it idis, Paracoccidioides brasilier.sis, 
Cryptoccccus necformans, Aspergillus spp., Sac rharonryces cerevi- 
10 s i a e , a 1 / ver: m y t e 7- 1 a c t i s , and Piccia pasroris. 

11. The method according to claim 9, wherein tne protozoan cells 
are s c lected f r on the group consisting of Giardia 1 arr.b 1 i a , 
Trichomonas vaginalis, Di entamoeba fraqilis, Trypanosoma spp., 
Leishmania spp., Entamoeba histolytica, Naegleria fowleri, 

15 Acanc namceba castellar.i, Harnanelia spp., Iscspcra belli, 
Cryptosporidium spp., Sarcccystis spp., Toxoplasma gondii, 
Plasmodium sop., 3abesia spp., and Balanrcidium coii. 

12. The method according to claim 9, wherein the plant cells are 
derived from a plant selected from the group consisting of 

20 Nicotiana tabacum (tobacco plant), Arabidopsis th.aliana, Bras- 
sica napus, Brassica juncea, Musa sp. (banana plant), rice, and 
corn . 

13. The method according to claim 9, wherein the animal ceils 
are selected from the group consisting of .mammalian cells, 

2 5 arthropod cells such as insect cells, avian cells, and piscine 
eel 1 s . 

14. A method according to claim 13, wherein the mammalian cells 
are selected from the group consisting of Chinese hamster ovary 
(CHO) cells, VERO cells, HeLa cells, W138 cells, BHK cells, 

30 COS-7 293 cells, and MDCK cells. 



15. The method according :o any cf the preceding claims, wherein 
the transformed cells examined in step (d) predominantly carries 
one single copy cf tr.e vector. 

16. The method according to claim 15, wherein transformation 
5 step (o) is performed under such conditions that the ceils 

transformed are predominantly or at most transformed with one 
single vector from, said pool, or wherein, prior to carrying out 
step (d) , cells being transformed with more than one vector from 
said pool are substantially excluded from the further steps. 

10 17. The method according to any of the preceding claims, wherein 
the modulator is a peptide. 

18. The method according to any of claims 1-16, wherein the 
modulator is a nucleic acid fragment such as an RNA fragment. 

19. The method according to any cf the preceding claims, wherein 
15 the modulator is stable towards proteolytic attack and/or is 

insensitive to a reducing environment. 

20. The method according to any one of claims 2-19, wherein the 
random nucleotides are introduced in part (s) of the parent 
nucleotide sequence which encode (s) the active site(s) of the 

20 parent peptide or parent ribonucleic acid, or the part(s) which 
encode (s) structure (s) interfering with the active site(s) . 

21. The method according to any one of claims 2-20, wherein the 
invariable part of the nucleotide sequence encodes truncated 
parts of the parent peptide or parent ribonucleic acid suffi- 

25 cient to maintain stability. 

22. The method according to any of claims 2-21, wherein the 
invariable part of the parent nucleotide sequence encodes a 
peptide which is free from di sulphide bridges. 
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23. The method accoramg :o any one of claims 2-21, wherein the 
■r. van able part, of the parent: nucleotide sequence encodes a 
peptide having disuiphide bridges. 



: J 4. The method according to any one of ciains 2 -23, wherein the 
5 random nucleotides are introduced in the form of an insertion or 
a substitution into the parent nucleotide sequence, optionally 
in combination with deletion (s) in the parent nucleotide se- 
quence . 

25. The method according to claim 24, wherein the number of 
10 random nucleotides which are introduced is in the range from 3 
to about 100 . 



26. The method according to any one of claims 2-25, wherein the 
random nucleotides are nucleotide sequences and/or are single 
random nucleotides introduced at specific sites ^n the parent 

15 nucleotide sequence. 

27. The method according to any one of claims 2-26, wherein the 
random nucleotides are selected from the group consisting of 

synthetic, completely random deoxyr lbonucleot ides ; 



synthetic random DNA sequences, wherein limitation on 
20 randomization of some nucleotides is introduced so as to 

limit the number of available sequences and/or to avoid 
undesired stop codons and/or to facilitate introduction of 
post-t rans la t iona 1 modifications of expressed peptide(s); 



synthetic random DNA sequences as in (1) or (2) coupled to 
25 a sequence encoding a purification tag; and 

CDR encoding nucleotide sequences isolated from a library 
of immune-competent cells raised against an antigen. 
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23. The neihoci according :c claim 27, wherein the CDR encoding 
nucleotide sequences encode CDR-3 peptide sequences. 

29. The men hod according to any one of claims 26-28, wherein the 
randon nucleotides are prepared by codon split synthesis where 

5 defined DMA codons are synthesized in a random order. 

30. The method according to claim 29, wherein the relative 
amount of synthesized codons ensure that all encoded amino acids 
will be present with substantially the same frequency in encoded 
polypeptide fragments . 

10 31. The method according to any one of claims 2-30, wherein the 
random nucleotides are introduced into the expression vector by 
ihe principle of site directed PCR-mediated mutagenesis. 

32. The method according to any one of the preceding claims, 
wnerein tne modulator ir. vivo reduces KM of the target enzyme 

15 for at least one substrate. 

33. The methoc according to any one of claims 1-31, wherein the 
modulator in vivo increases KM of the target enzyme for at least 
one substrate . 

34. The method according to any of the preceding claims, wherein 
20 the modulator in vivo increases Vmax of the target enzyme for at 

least one substrate. 

35. The method according to any of claims 1-33, wherein the 
modulator in vivo reduces Vrr.ax of the target enzyme for at least 
one subs t rate . 

25 36. The method according to any one of the preceding claims, 
wherein the parent peptide or parent ribonucleic acid is an 
inhibitor of activity of the target enzyme. 
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37. A method according to claim 36, wherein the inhibitor is 
seLec:ed trom the group consisting of 

a B?TI/Kuni:z family protease inhibitor, a serpin family pr ore- 
ase inhibitor, a Kaza 1 t arm 1 y protease inhibitor, a soybean 
5 trypsin inhibitor (Kunitz) family protease inhibitor, a potato 
inhibitor I family member, a Bowman-Birk family protease inhibi- 
tor, a squash inhibitor family member, a wap-type ' Fcjr-disul- 
fide Core 1 proteinase inhibitor, a hirudin family protease 
inhibitor, a factor Xa inhibitor, an Ascaris trypsin inhibitor 
10 family member, a cystatin family protease inhibitor, a calpain 
family cysteine protease inhibitor, a tissue inhibitor of 
metalloproteinasos family member, a ca r boxypept Ida se A inhibi- 
tor, a metallocarooxypeptidase inhibitor, an angiotensin-con- 
verting enzyme inhibitor, a cereal alpha-anyl ase/t ryps in inhibi- 
15 tor family member, an a lpna - amy 1 a se / t r yos i n inhibitor homologous 
to thaumatin, an alpha-arr.ylase/subtilisin inhibitor family 
member, an inhibitors of insect a lpha-amyiases , an inhibitor of 
mammalian alpha- amylases derived from Streptomyces species, a 
trehaiase inhibitor, a polygalacturonase inhibitor, a 
20 fucosyltransf erase inhibitor, a protein kinase C inhibitor, 

an cAMP-dependent protein kinase inhibitor, a cyclic nucleotide 
phosphodiesterase inhibitor, a protein phosphatase inhibitor, a 
TCD/MRS6 family GDP dissociation inhibitor, an ATPase inhibitor, 
a phospholipase A2 inhibitory protein, a ribonuciease inhibitor, 
25 an RNA polymerase inhibitor, a DNA-entry nuclease inhibitor, and 
a beta -lactamase inhibitor. 

38. The method according to any of the preceding claims wherein 
the substantially identical cells are mammalian cells and the 
vector is selected from the group consisting of a retroviral 
30 vector, a vaccinia virus vector, an adenoviral vector, an adeno 
associated virus (AAV) vector, a herpes simplex virus (HSV) 
vector, an alpha virus vector, and a semliki forest virus vec- 
tor . 



Mil- B^tec^C'cllSciwn: DK 1 .1»K.M99S-07-:0 




ec 

39. Trie .T.ethod according to claim 38, wherein the vector is 
retroviral . 

40. The method according to ciain 39, wherein the retroviral 
vect:>r is derived from retrovirus selected from the group con- 



5 sistir.g of Avian Leukosis-Sarcoma Virus (ASLV) , Maramalian type 
C, Mammalian type B, and Lentivirus, and optionally modified 
with heterologous cis-acting elements. 

41. The method according to claim 39 or 40, wherein the retrovi- 
ral vector has non- identical ends. 

10 42. The T.ethod according tc claim 41, wherein the non-identical 
ends contain non-identical promoters. 

43. The method according to any one of claims 39-42, wherein the 
retroviral vector contains heterologous promoter replacing the 
viral promoter in the 5 ! -LTR, such as a CKV promoter, an RSV 

15 promoter, an SV-4C promoter, a TK promoter, an MT promoter, or 
ar. inducible system such as Tet or Ecdysone. 

44. The method according to any one of claims 39-43, wherein 
step (a) is carried out by 

1) :ransfec:mg suitable packaging cells with vectors com- 
20 prising the randomly modified nucleotide sequences and 

which are integ rat able in virions produced by said pack- 
aging cells, 

2) culturing said transfected packaging cells in a culture 
medium under conditions which facilitates production by 

25 the packaging cells of virions containing the randomly 

modified nucleotide sequences, 

3) recovering and optionally concentrating said virions, and 
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4) transducing said substantially identical cells with the 

vi rions . 

45. The nerhor: according to clam 44, wherein the packaging 
cells are selected from the group consisting of PE531, Bosc23, 

5 Psi2, GP^ESG, ?n:enixEco, PA317, GP-AM12, DA(ampho), 3ing, 
FL YA1 3 , ?roPa'<, CRIP, PsiAM, Pnoemx-Ampho, ?G13, and H9 
( 2 9 3G?G) - 

46. The methoa according to any of the preceding claims wherein 
the randomly modified nucleotide sequences are coupled to a 

10 nucleotide sequence encoding at least one fusion partner. 

47. The rr.ethod according to claim 4 6, wherein the fusion partner 
serves to facilitate expression and/or purification/isolation 
and/ or further stabilization of the expression product. 

48. The method according tc claim 47, wherein the fusion partner 
15 incluaes a purification tag such as His6 tag, myc tag, BSP 

biot rnyla t ion target sequence, of BirA, flu tag, iacZ, and GST. 

49. The method according to claim 46 or 47, wherein the fusion 
partner is a sorting signal or a targeting sequence. 

50. The method according to claim 45, wherein the sorting signal 
20 is a signal patch or a signal peptide. 

51. The method according to claim 49 or 50, wherein the sorting 
signal effects export of the expressed peptide out of the cell 
or into the cell membrane, or, when the substantially identical 
cells are eukaryotic, into endoplasmic reticulum, into Golgi 

25 apparatus, into lysosomes, into secretory vesicles, into mito- 
chondria, into peroxisomes, or into the nucleus. 
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52. The me" hod any one of claims 49-51, wherein the sorting 
signal or targeting sequence is selected from the group consis- 
ting of 

a nuclear localization signal (NL3) such as Pro-Lys-Lys-Lys-Arg- 
5 Lys-Val (5V40 large T antigen NLS ) , Ala-Arg-Arg-Arg-Arg-Pro 

(hunan retinoic a:id receptor - 3 NLS), Glu-Glu-Val-Gin-Arg-Lys- 
Arg-Gln-Lys-Le j (N'FkB p50) , Glu-Glu-Lys-Arg-Lys-Arg-Thr-Tyr-Glu 
(NFkB p65), and Ala-Val-lys-Arg-Pro-Ala-Ala-Thr-Lys-Lys-Ala-Gly- 
Gln-Ala-Lys-Lys-Lys-Lys-Leu-Asp (Xenopus nucleoplasmin NLS); 

10 a membrane ancnoring sequence such as those derived from CDS, 
ICAM-2, IL-3, CD4, and LFA- 1 , and a lipidation sequence such as 
a tnyristylacior. or a palmi toylat ion sequence; 
a lysosomal sorting signal such as a lysosomal degradation 
sequence, and a lysosomal membrane sequencea mitochondrial 

15 localization sequence such as a mitochondrial ma:rix sequence, a 
mitochondrial inner membrane sequence, a mitochondrial intermem- 
brane space sequence, and a mitochondrial outer membrane se- 
quence; an endoplasmic reticulum localization sequence such as 
the sequence from calreticulin (KDEL) and the sequence from 

20 adenovirus E3/19K protein (1U1SRRSFIDEKKMP) ; 

a peroxisome sequence such as the peroxisome matrix sequence 
from Luciferase; 

a f arnesyiat ion sequence such as LNPPDESGPGCMSCKCVLS ; 
a ge rany lge ran y 1 a t i on sequence such as LTEPTQPTRNQCCSN ; 
25 a destruction sequence such as RTALGDIGN; and 

a secretory signal sequence such as the secretory signals from 
IL-2, growth hormone, preproinsulin, and influenza HA protein. 

53. The method according to any one of the preceding claims, 
which further comprises the step of resolving the 3 -dime r. s ion a 1 

30 structure of the identified modulator. 

54. A method for the preparation of a replicable expression 
vector, the method comprising the steps of identifying a modula- 
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tor cy the method according to any one of claims 1 



-53, and sub- 



seouentlv 



no or synthesizing a .nucleic acid sequence which 



entodes the modulator, and 



5 



li) 



engineering a rep li cable expression vector comprising an 



ope r on which comprises, in the 5' -3' direction and in 
operable linkage, 1) a promoter for- driving expression of 
the nucleic acid sequence, 2) optionally a nucleotide 
sequence encoding a leader peptide, 3) the nucleic acid 
sequence, and 4) a termination signal. 



55. The method according zo claim 54, wherein the promoter is 
inducible . 

56. The method according to claim 55, wherein the promoter is 
selected from the group consisting of a bacterial promoter, a 

15 fungal promoter such as a yeast promoter, and a mammalian pro- 
moter. 

57. A method according to any of claims 54-56, wherein the 
expression vector is a plasm id, a phage, a cosmid, a mini chromo- 
some, or a virus. 

20 58. A method according to any of claims 54-57, wherein the 

expression vector is capable of being integrated into the genome 
of a suitable host cell. 

59. A method according to any of claims 54-58, wherein the 
nucleotide sequence is incorporated into a vector selected from 

25 the group consisting of pBR322, phage-X and YRp7. 

60. A method for the preparation of a transformed cell carrying 
a nucleic acid sequence encoding a modulator as defined in any 
one of claims 1-53, the method comprising transforming a suit- 
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able host ceil with an expression vector prepared according to 
any one of claims 54-59. 

61. A neinod for providing a modulator as defined in any one of 
claims 1-53, the method comprising 

5 1) growing a transformed cell prepared according to the 

method of claim 60 in a culture nedium under conditions 
which facilitates expression by the cell of the randomly 
modified nucleotide sequence, and 
TI) subsequently harvesting the expression product from the 
10 cell and/or the culture medium, or 

la) identifying the modulator according to the method of any 

one of claims 1-53, and 
lb) subsequently synthesizing the modulator by means of chemi- 
cal synthesis on the basis of the sequence determined in 
15 step ( e) . 

62. A method for isolating a target biomoiecuie, the method 
comprising providing a modulator according to the method of 
claim 61 and subsequently using the modulator as an affinity 
iigand in an affinity purification step so as to isolate the 

20 target biomoiecuie from a suitable sample. 

63. The method according to claim 62, wherein the affinity 
purification step is an affinity chromatographic step, an affi- 
nity mass spectrometry step, or a co- immunoprecipi tation step. 

64. A method for isolating a target biomoiecuie, the method 

25 comprising providing a peptide modulator according to the method 
of claim 61 and subsequently using the modulator as a probe 
against a cDNA library derived from the substantially identical 
cells or using the modulator as bait in a two- or three-hybrid 
system . 
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,35. The method according to any of claims 61-63, wherein the 
tar. get bicmoleoule is a peptide :r a nucleic acid. 

66. The method according to c.;i:ti 64 or 65 farther comp r i s i ng 
-,nc step of determining the ammo acid sequence of the peptide 

5 or de:err.:r.irc the nucleotide sequence of rhe nucleic acid. 

67. The method according to clan 66, further comprising the 
step of resolving the 3-dinens iona 1 structure of the target 
biomslecule. 

68. A method for selecting a chemical compound as a lead com- 
10 sound m drug development, the method comprising the steps of 

assaying a library of chemical compounds for interaction 
with a target bicm.clecule which has been de novo isolated 
according to :. he method of any one of claims 62-67, and 
selecting compounas which interact significantly with the 
15 target biomolecuie. 

63. The method according to claim 68, wherein the library of 
chemical compounds has been providec by chemical synthesis upon 
initial identification of the members of the library by 
structure-based or nor.-st ructure based computer drug-modelling. 

20 70. A method for the preparation of a medicinal product, the 
method comprising the steps of 

A) selecting a lead compound by the method according to claim 
68 or 63, 

B) performing pre-ciinicai tests with the lead compound in 
25 order to assess the suitability thereof as a medicinal 

product , 

C) entering, if the lead compound is deemed suitable in step 
(B) , clinical trials using the lead compound in order to 
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obtain market authorization for a medicinal product in- 
cluding the lead compound as a pharmaceut ica 1 ly active 
suostance, ana 

D) upon grant of a market authorization, admixing the leaa 

5 compound wirh a pharmaceut ica 1 1 y acceptable carrier, 

excipier.t or diluent and marketing the thus obtained 
medicinal product. 
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